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Traffic Capacity of a Probability-Engineered 
| Trunk Group 


By D. W. HILL and S. R. NEAL 
(Manuscript received January 22, 1976) 


The grade of service of a probability-engineered trunk group vs defined 
to be the 20-day average blocking during the busy hour of the busy season. 
In this paper, an wmproved model for calculating grade of service is de- 
veloped and used to tncrease the accuracy of the existing trunk-engineering 
procedures. Using the new model, new traffic-capacity tables and trunk- 
estimation algorithms have been designed for use in the Bell System. 


]. INTRODUCTION 


The grade of service for a probability-engineered trunk group is 
defined to be the average blocking observed in the time-consistent 
busy hour of the busy season.* The existing methods for predicting 
grade of service do not account for the effects of the finite length of the 
individual one-hour measurement intervals and thereby tend to under- 
estimate trunk-group capacity. 

In this paper, we develop an improved model for calculating average 
blocking that includes the two essential effects of the finite measure- 
ment interval. First, the current method for estimating the mean 
blocking for a single hour must be revised to remove an implicit 
assumption that the measurement interval is infinite. Second, the 
existing mathematical model for day-to-day variation of trunk-group 


*Both the CCITT (International Telegraph and Telephone Consultative Com- 
mittee) and the Bell System define grade of service as an unweighted average of 
busy-hour busy-season blocking values. 
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offered loads must be modified to account for statistical measurement 
error, which is also introduced by the finite measurement interval. 

We develop a new model of day-to-day load variation in Ap- 
pendix A and in Section II combine it with a new estimate of mean 
single-hour blocking to obtain our approximation for the average 
blocking. This new approximation is then compared with the existing 
approximation analytically and numerically; the accuracy of the new 
approach is established in the third section using data from a computer 
simulation. A summary is given in the last section. 


Il. AVERAGE BLOCKING 


In this section, we develop an approximation for the average busy- 
season busy-hour blocking. Since the measured grade of service is de- 
termined from blocking measurements made over several hours, and 
since the busy-hour source loads vary from day to day, our analysis 
must account for the effects of such load variation. Accordingly, we 
first discuss day-to-day load variation and then describe our approxi- 
mation for average blocking. 


2.1 Day-to-day load variation 


R. I. Wilkinson was the first author to study the impact of day-to- 
day variation in offered loads on trunk-engineering procedures.'” He 
collected data from a number of trunk groups that indicated that the 
distribution of the observed loads could be approximated by a gamma 
distribution. The data also indicated that the variance, Var (@), of 
the observed load, &, was related to the mean @ by 


Var (4) = 0.133%, (1) 


where ¢ is a parameter whose value depends on local conditions. 

Wilkinson’s studies showed that Var (@) tends to be relatively larger 
for overflow traffic than for first-routed (Poisson) traffic. Those results 
led to the specification of three values of ¢ (1.5, 1.7, and 1.84) to cover 
a reasonable range of engineering applications. The level of day-to-day 
variation is called low when ¢ = 1.5 is appropriate, medium for 
@ = 1.7, and high when ¢ = 1.84. 

In Appendix A, we show that the variance of the observed ‘‘single- 
hour” offered loads is a sum of two components: 

2 202 
Var (&) = Why + Var (a), (2) 

where @ is the daily source load (a random variable), & is the observed 
load, a = Ea) is the average busy-season load, z is the traffic peaked- 
ness, t is the observation interval (usually one hour), and A is the 
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mean holding time. The first term is an approximation for the variance 
arising from the finite measurement interval* and Var (a) is the vari- 
ance of the daily source load. 

Combining eqs. (1) and (2), we obtain a model relating the source- 
load variation to the observed load variation; i.e., 


2az : 
(¢/h) 
For certain combinations of the various parameters, 2a@z/(é/h) can ex- 
ceed 0.13a%, indicating that the observed load variation is entirely 


due to random measurement error resulting from the finite measure- 
ment interval. For our application, we will assume that 


Var (a) = 0.13a* — (3) 


2az 


Var (a) = max 30, 0.138a* — (hy 


(4) 


2.2 The approximation 


Consider a service system with c servers having exponentially dis- 
tributed service times with mean h and serving traffic under a blocked- 
calls-cleared service discipline. The system is observed during n dis- 
joint measurement intervals J,, ---, Z,, each of length t. During J;, 
the interarrival times are independent and identically distributed (iid) 
with mean 1/d;. The peakedness, z, of the traffic is assumed to be the 
same during all the intervals.t The system is in statistical equilibrium 
during each interval and the initial point of each interval is a stationary 
(random) point for the arrival process. The loads a; = \;h,7 = 1, °°-, 7, 
are independent and identically distributed according to the distribu- 
tion function I'(@|&, va) with mean & and variance va = Var (a) (the 
day-to-day source-load variance). [We assume that I'(a|&, va) is a 
gamma distribution. Justification for the assumption is given below. | 

We use A;(t) and O;(é) to denote, respectively, the number of arrivals 
(call attempts) and the number of blocked attempts (the overflows) 
during J;; the (measured) observed blocking is B; = O,(t)/A;(t) pro- 
vided A,(t) #0. If A;() = 0, then O;(é) = 0 and the ratio is not 
defined. However, if no arrivals occur during J,;, it seems appropriate 
to say that no blocking occurred; i.e., we define B; = 0 whenever 
- A;(t) = 0. The sample average of the observed blocking (the observed 
grade of service) is 


B, = 


Sir 
Ms 


Bj. 
j=l 

* This is an extension of the formula given by Riordan in Ref. 3. 

' Traffic studies have shown that the peakedness of the traffic offered to a final 
trunk group does not change significantly from day to day during the busy hour of 
the busy season. 
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Since a1, -°-, @, are lid, B,, ---, B, are also iid. Consequently, the 
average blocking B is given by 


B = E(B,) = E(B). (5) 


The hourly measurement B, represents a random sample of observed 
blocking corresponding to a load population with mean a;. The 
parameter qa, is a random variable with mean @ and distributed accord- 
ing to '(a;|&, va). Thus, 


Be / “EB | Of) 
0 1 
Dropping the subscripts, the conditional mean in (6) is given by 


E a a| = Pr {A(t) > Ola}E | oo) 








ca} dT (ax|a, v2). | (6) 





A(t) A (é) 


where Pr (X) denotes the probability of the event X. Let 
A = E{A(t)|a} and O = E{O(#t)|a}. Also, for nonnegative m and n, 


a, A(t) > of 


“CAO _ E{LOWILAWF a3 
E{LO@ LAMP la, AW > 0} =  Prizg > ole” 

Using these relations, an approximation for the conditional expecta- 
tion is obtained by expanding the function x/y in a two-dimensional 
Taylor Series about the point (xo, yo) = (H{O(t)|a, A(t) > 0}, 
E{A(t)|a, A(t) > 0}). Taking the appropriate conditional expectation 
and retaining only the terms up through second order, we have the 
approximation 


B) oO O(t) 





oo 0 +(= {A(t) > Ola} ) 


| E{[A (t) Fle} -{ A | 
Pr {A(d) > 0] a} Pr {A(t) > O/ a} 
= (Fee ley 
A 
( E{AO(i)|a} _ AO ). (7) 
Pr {A() > Oa} (Pr {A@® > Ola})? 
Denoting the call congestion O/A by B(c, a, z) and noting that 
E{A(t)|a} = at/h, eq. (7) reduces to 
Pr {A(t) > Oa} 
(at/h)? 


- [B(c, a, 2) Var {A(t)|a} — Cov {A(@), O@|a}], 


E {38 a, A(t) > o| CoB een e\ 
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where all of the statistical moments are functions of a, 2, and c. Thus, 


b{ 2Mla| nePr AW > ola)B(e 2,0) + (PAW 2 Olel 


A(t) 
- [B(c, a, 2) Var {A()|a} — Cov {A(), O@|a}]. (8) 


Formulas for computing the moments are given in Ref. 4. An expres- 
sion for Pr {A(t) > 0|a} is given in Appendix B. Numerical experi- 
mentation has shown that Pr {A(t) > O|a} 1 for at/h > 10; i.e., 
that term can be ignored except for very small loads or short measure- 
ment intervals. Combining eqs. (6) and (8), we have 


Br . Pr {A(t) > 0]a}B(c, «, 2)d(ala, va) 


at is Ric, a, zal (ala, va), (9) 


where 
R(c, Qa, Z) _ (= aa oie ) 
- [B(c, a, z) Var {A() |e} — Cov {A(@), O@ |e} J. 


The approximation is complete when [I is specified. 

Numerical experimentation has shown that B is not very sensitive 
to the shape of I'(a@|&, va) for fixed values of &@ and vg. Accordingly, 
following Refs. 1 and 2, we have assumed that I'(a@|@, va) is a gamma 
distribution with mean & and variance vg, where vg = Var (a) is given 
by eq. (4). With these assumptions, the integrals in (9) can be com- 
puted numerically using a 5l-point compound Simpson’s rule. The 
accuracy of the approximation is discussed in Section III. In Section 
2.3, We compare this approximation with the existing procedure. 


2.3 The existing procedure 


The existing approximation for average blocking is!? 
B.(c, &, 2) & / B(c, a, 2)al (al, v), (10) 
0 


where 

v = Var (@) = 0.184. 
Consequently, there are two essential differences between the present 
method (10) and our approximation (9). 

First, the existing method neglects the random component of load 
variation and assumes that all of the variation in the observed loads 
is due to day-to-day changes in the source load; 1.e., the assumed load 
variation is too large by an amount 2@2/(t/h). The integral in (10) is 
an increasing function of v in the range of engineering interest (less 
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than 5 percent average blocking) and, hence, B, is larger than it would 
be if vg were used in place of »v. 

Comparing (9) and (10), we see the second difference. The existing 
procedure implicitly uses E{[O(t)/A (t)]|a} & B(c, a, z) and neglects 
the term R(c, a,z). Numerical experimentation has shown that 
R(c, a, 2) is negative in the range of engineering interest and that 
|R(c, a, z)| is an increasing function of z. For z near 1, R(c, a, 2) 
is negligible. However, for z = 2 R(c, a, 2) becomes significant. Con- 
sequently, B(c, a, 2) is larger than E{[O(t)/A(t)]|a}, and the differ- 
ence increases with z. 

The two differences combine to cause B < B, in all regions of 
engineering interest; i.e., the average of the observed single-hour 
blocking probabilities is less than that predicted by the existing method 
(10). Quantitative comparisons are made in Section III. 


lil. NUMERICAL RESULTS 


To determine the accuracy of our approximation (6) relative to the 
existing procedure (10), a computer simulation was constructed for a 
full-access trunk group satisfying the assumptions specified in Section 
2.2. The simulation was run for a large range of the system parameters 
c, &, 2, and ¢ covering the regions of engineering interest (B & 0.01, 
c= 2,127). Generally, a mean holding time of 180 seconds 
was used (although both smaller and larger values were used for sen- 
sitivity tests)* and 20-day averages were generated. All statistics are 
based on a sample size of 50; 1.e., 1000 simulated hours. 


3.1 Simulation output 


Typical results from the simulation are summarized in Tables I and 
II. First consider Table I. The first four columns are the input parame- 
ters for the simulation. In order, they are the peakedness z, the offered 
load &, the trunk-group size c, and the conventional exponent ¢ defining 
the level of day-to-day variation. [That is, Var (a) was adjusted so 
as to produce the desired ¢, as discussed in Section 2.2.] The next 
three columns give the simulated 20-day average blocking B., the 
variance v, of the daily offered source loads, and the (total) variance 
vg of the observed loads. The measurement variance vz is then com- 
puted as vz = v4 — vq, and is compared with 2az/(é/h) in the last two 
columns. In the cases shown, 2az/(t/h) is an adequate approximation 
of the variance introduced by a finite measurement interval. The other 


*The holding time was varied from 100 to 360 seconds and a small effect was 
observed. The effect is negligible for most engineering applications. Based on a survey 
of observed holding times, AT&T has requested that h = 225 seconds be used for 
the new traffic tables. 
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Table | — Simulation results 


Simulation Input Observed Data 
ae ‘ Theoretical 
Peaked-| Mean T Variation Average Input Observed | Residual : 
runks : : : : Variance 

ness Load (c) Parameter | Blocking | Variance | Variance | Variance Qaz 
(z) (a) (¢) (Bo) (va) (v&) (va) (= 
t/h 

1.0 4.01 10 1.5 0.0083 0.72 1.03 0.31 0.40 
4.0 17.80 40 1.5 0.0084 4.81 12.54 7.73 7.12 
4.0 9.80 30 1.7 0.0049 3.60 6.61 3.01 3.92 
7.0 9.75 40 1.84 0.0046 1.76 8.96 7.10 6.82 


test cases indicated that the accuracy of the approximation generally 
increases aS & increases (as one would expect from the asymptotic 
nature of the approximation). 


3.2 Existing approximation 


The data in the first seven columns of Table II illustrate the size 
of the bias in the existing method. The first five columns of the table 
are the same as those in Table I. The next column B, represents the 
computed average blocking corresponding to the source load, peaked- 
ness, trunk-group size, and respective day-to-day variance used in the 
simulation. The next column é, gives the corresponding estimate (using 
the existing method) of the trunk-group size necessary to achieve the 
simulated blocking for the given input load, peakedness, and day-to- 
day variation parameter ¢. 

First, note the difference between By and B,. The existing approxi- 
mation B, is always larger than the actual average blocking B,, and 
the relative difference increases as z increases. The bias in B, will 
cause the engineering estimates of trunks required (to meet objective 


Table Il— Engineering methods 


Simulation Engineering Methods 
Input Output Existing Method New Method 
oe Simulated , F 

Peaked-| Mean Perak Variation N Blocking Trunk Blocking Trunk 
ness Load (c) | Parameter Sask Estimate | Estimate | Estimate | Estimate 
@ | @) () (a) | (Bed (¢e) (B) (é) 
1.0 4.01 10 1.5 0.0083 0.0100 10.24 0.0078 9.91 
4.0 17.80 40 1.5 0.0084 0.0145 42.83 0.0083 39.97 
4.0 9.80 30 1.7 0.0049 0.0103 32.96 0.0054 30.31 
7.0 9.75 40 1.84 0.0046 0.0106 44.71 0.0045 39.93 
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service) to be too large. The bias in the trunk estimates is illustrated 
by comparing c with é,.* 

In all the cases we considered, the bias was primarily a function of 
z. For z = 1, the bias was generally less than one trunk. For z = 2, 
é. — c was usually between one and two trunks, and, for z = 4, the 
bias ranged between three and five trunks. The corresponding relative 
errors (c — é.)/c were largest at the smaller values of c. 


3.3 New approximation 


The relative accuracy of the new approximation is illustrated in the 
last two columns of Table II. In these cases, the new approximation 
for B is much closer to the simulated blocking B,. The relative differ- 
ences are quite small, especially when compared with the correspond- 
ing errors in the existing approximation. Since the estimate of B is 
good, the corresponding estimate é of trunks required to achieve B, 
is quite close to c, the number actually required. 

Similar results were obtained in all the other test cases. Accordingly, 
we conclude that the existing approximation (10) is biased, but the 
bias is essentially removed by using the new approximation (9). 

Based on these results, the new approximation has been used to 
generate new trunk-engineering tables and algorithms for use by the 
Bell System. 


IV. SUMMARY AND CONCLUSIONS 


By combining a new model for day-to-day load variation with a 
new estimate of mean single-hour blocking, a new approximation was 
obtained for estimating the grade of service for probability-engineered 
trunk groups. Using this result, it has been possible to improve the 
accuracy of the presently recommended trunk-engineering procedures 
and thereby realize an increase in predicted trunk-group capacities. 
The increases are smaller for trunk groups serving Poisson traffic, 
but become substantial as the peakedness and level of day-to-day 
variation increase. The new approximation has been used to develop 
new trunk-engineering tables and algorithms that will soon be intro- 
duced into the Bell System. 


APPENDIX A 
Day-to-Day Load Variation 

Let S denote a GI/M/ service system which is observed over k 
disjoint intervals of time {J,, ---, Zz}, each of length ¢.t The inter- 


* Refer to Section 2.3 for a discussion of the individual components of the difference. 
The infinite server system is a standard model for traffic engineering. It is used to 
characterize a traffic process in terms of only interarrival times and service times. 
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arrival times during J; are independent and identically distributed 
(iid) according to the probability distribution function F; with mean 


1 [oe] 
vie I tdF;(t). 


The arrival rates {\i, ---, A} are lid with mean H(A) and variance 
Var (A). The peakedness (variance-to-mean ratio) of the arrival process 
is constant for all intervals. The service times are lid according to a 
negative-exponential distribution with mean h. The system is assumed 
to be operating in statistical equilibrium at the beginning of each 
interval; i.e., the initial point of each interval is a stationary point for 
the arrival process (see Ref. 3). 

Let N; be the number of arrivals during J; and let £;; denote the 
service time of the 7th arrival in J;. The average usage during 
I = >¢%., 1; is estimated by* 


my k 


I is 


(We assume the edge effects are ane since S is in equilibrium 
during each interval J;.) The corresponding estimate of average offered 
load is 
In this section, we obtain the mean and variance of @x. 

From Ref. 5 and eqs. (11) and (12), we have 


. 1 k Nj; 


1 k 
ki > . {AN ;} 


h k 
i XNA. (13) 


Since the beginning of each interval is a stationary point for the arrival 
process, it follows that E{N,;|\;} = A, (see Ref. 6). Thus, for the 
arrival process, it follows that 


E{N;} = BEAN |N}} 
= E {h;t} 
Ni 
= E{att 


*Our studies have shown that the additional measurement variance caused by 
discretely sampling the usage with a 100-second-scan Traffic Usage Recorder was 
negligible when compared with the variance caused by the load variation. 
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and so from (13) we have 
E{é.} = E{y}h. 
Since the &;; are independent, eqs. (11) and (12) yield 


Ni 
Var (a) = cay 2 = Var ee tu , (14) 
and from Ref. 5, 


Nj Ni 
var | 3 > tu} = B {Var | oz Euldsl| + Var {z | 2 bul (15) 
i=] 3 N; | t=1 
We first expand 
Ni Ni 
Var | 2 Euilds| = Var {z | D &e3|Ns, aif} 
n; | ¢=1 Nj i=l 


Ni 
ote B {Var | pa Eij|N,A3 i} 
Nj = 
= Var {N jh|d3} + E {N jh? |); } 
Nj Nj 
Var {Nj] Aj} ): 
EXN3|X3} 
Let z denote the traffic peakedness (which is constant over J). Then 
from Ref. 7, we have the approximation 
Var {N;|A3} 
EXN 5X3} 


the approximation has been found to be quite good for a > (2 — 1) 
and t 2 10h (and probably acceptable for engineering purposes for 
t = 5h). Substituting (17) into (16), we obtain 


= WA;t (1 a (16) 


mw 22-1; (17) 


Ni 
Var | py Bula} = 2zh7tn;. (18) 
Nj t=] 


Expanding the second term in (15) in the same manner as in eq. 
(13) yields 





Bx Ex) 03 | = hthj;. (19) 
Substituting (18) and (19) in (15) provides 
Ni 
Var | y Eu] = QzhtH{d} + (ht)? Var {Xr}, (20) 
i=1 
which, with eq. (14), implies that 
Var (&:) = (AY + h? Var i): (21) 
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The offered load during J; is a; = jh. Hence, E{a;} = hE{} and 
Var {a;} = h? Var {\}. Dropping the subscript j, we have 


i (ee 
k\ t/h 





Varies + Var {a} ); (22) 

Equation (22) has a simple interpretation. The first term (22H {a})/ 
(t/h) represents the component (of the observed variation) which is 
due to a finite measurement period. The second component Var {a} 
is the ‘“‘true’’ (day-to-day) variation of the offered source load. The 
factor k results from averaging k observations. Setting s = 1 and 


a = E{a}, we have the variance of the single-hour load estimate 


_ 22% 


Var {a} ~ Th 


+ Var {a}, (23) 


where the individual terms have the interpretations noted above. 


APPENDIX B 
Probability of an Arrival 


In this appendix, we derive an expression for Pr {A(t) > Ola}. We 
assume that the interarrival times are independent and identically 
distributed according to the distribution function F. We further assume 
that F is a mixture of exponentials, with the parameters chosen so 
that F approximates the interarrival distribution of an overflow pro- 
cess with load a and peakedness z (see Ref. 8); that is, 


F(t) = 1 — kyew™* — kee7"*. 


Now, Pr {A(¢) > 0|a} is Just the probability that the first arrival 
after a random entry point (i.e., a stationary point for the arrival 
process) will occur before ¢ units of time have elapsed. Thus, from Ref. 
9, we have 


Pr {A() > Ola} =@ f [1 — F@)Ide 


—— a(B+2 — Wi at — tent), 


Ty, T2 T1 T2 
Since fo* [1 — F(x) ]dx = 1/a, it follows that 


Pr {A (Zé) > Ola} =l-ea (2 enrit Ht ent) 
Ty i) 


where the parameters k; and r; are functions of a and z as specified in 


Ref. 8. 
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Analysis of Toll Switching Networks 


By R. S. KRUPP 


(Manuscript received December 26, 1975) 


Two techniques are introduced for extending C. Y. Lee’s method of 
switching network analysis to cases of toll-type networks. The methods 
avoid certain inconsistent independence assumptions which would other- 
wise be a source of inaccuracies. One method partitions the Lee graph in a 
special way, while the other uses a lemma that characterizes the generating 
function of an average network property. Examples are worked for three- 
stage networks and a model of the No. 4 ESS. 


I. INTRODUCTION 


In a well-known 1955 article) C. Y. Lee introduced simplified 
methods for the analysis of switching network characteristics, such 
as blocking probability. Using a probability linear-graph (hereafter 
called Lee graph) to represent the network and an assumption of 
independent link occupancies, he described ways to quickly obtain 
approximate expressions in many cases of interest. As an example of 
possible inaccuracy, Lee pointed out a three-stage network that is 
known to be nonblocking but is assigned a nonzero blocking prob- 
ability by his method. 

The present work introduces two different techniques for extending 
Lee’s method to avoid certain inconsistent independence assumptions, 
which are the source of the inaccuracies he noted. The extended 
methods will, for instance, reproduce M. Karnaugh’s more accurate 
expression? for blocking probability of a three-stage network, but with 
less mathematical labor. When applied to a “generalized” three- 
stage network that can model the No. 4 uss,’ the techniques yield 
formulas that greatly simplify expressions currently in use. The 
appendix lists a computer routine to calculate blocking for the case of 
generalized three-stage networks. 


il. FIRST METHOD 
2.1 Generalized three-stage switching network 


Consider first a ‘‘generalized” three-stage switching network, as 
indicated schematically in Fig. 1. The solid circles in the first and last 
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Fig. 1—Generalized three-stage switching network. 


stages denote ordinary nonblocking switches, of sizes N X M and 
M XL, respectively, such as crosspoint arrays or time-slot inter- 
changers. The open circles in the middle stage, however, stand for 
switches that could block. Such a switch might, at a given time, have 
only a probability, rather than a certainty, of being able to connect a 
given pair of A- and B-links incident on it. This could model addi- 
tional stages of switching network, such as the time-shared space- 
division stages in the No. 4 ess.’ An independent blocking probability 
@ is assigned to each middle-stage switch. We use the notation 
Q = 1 — Q to indicate the corresponding transmission probability. 
The overbar denotes the probabilistic complement in all formulas 
that follow. 

The Lee! graph in Fig. 2 shows all paths of the three-stage network 
that might be used to connect one call between a specified pair of 
terminations. Besides that pair, there are H = N — 1 other input 
terminations at the first-stage switch and F = LZ — 1 other output 
terminations at the last-stage switch. If we assign occupancy prob- 
abilities P and R, respectively, to input and output terminations 
other than the designated pair, the A- and B-links will have average 
occupancies Py) = PE/M and Ry = RF/M. 

If the first stage is a concentrator with # > M, it would be mathe- 
matically inconsistent to assume that the termination occupancies P 
were independent. In fact, that would imply a nonzero probability 
P¥ of handling F calls on only M links. It would not be inconsistent 
to assume a priort that the A-link occupancies Py were independent 
when # 2 M. A similar discussion applies to the last stage for the 
case F > M. Assuming that all probabilities Po, Q, and Ro are inde- 
pendent, the transmission probability for any single path becomes a 
product P,.QRo of transmission probabilities for each of its portions. 


Fig. 2—Lee graph of generalized three-stage network. 
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Then the blocking probability for the network is just the product of 
blocking probabilities for each of the 47 independent paths. 


a= Ol POR) = POR, . (1) 


If there is expansion in the first stage, with M > E, it would be 
mathematically inconsistent to assume that the link occupancies Po 
were independent. Indeed, that would imply a nonzero probability 
P# of busying all MZ links with at most F calls. It would not be in- 
consistent to assume that the termination occupancies P were inde- 
pendent when JJ 2 LZ. A similar discussion applies to the last stage 
for the case Af > F. In the boundary case EH = M, assuming random 
connections through the switch, we find that each kind of independence 
assumption (link or termination) implies the other with P = Po, and 
neither is inconsistent. Whether either assumption agrees with ob- 
served behavior of traffic is a difficult question that will not be ex- 
plored here. 


2.2 Toll-neutral case 


To shorten our terminology, we will name the case # > M “local,” 
the case H < M “toll,” and the case # = M “neutral.’”’ To emphasize 
the distinction, we cite the celebrated ‘‘Clos-type’” network. Clos4 
showed that a pure three-stage network (the case Q = 0) will be non- 
blocking when M > E + F. But (1) can vanish only when Py and Ry 
do. The contradiction arises from using the link independence assump- 
tions in a toll case. The local case will not be considered in this study, 

No matter what assumptions are appropriate, the event of having K 
busy A-links is the same as that of K busy input terminations, whence 
they have equal probability. In the toll case this is 


ra= (ic) PEG - PES, (2) 
since there are (#) ways to choose K busies plus # — K idles among E 
terminations, and each arrangement occurs with probability 
PX(1 — P)#-*X, when termination occupancies P are independent. 
Assuming that all probabilities P, Q, and Ro are independent, trans- 
mission probability for any single path containing an idle A-link is 
QRo, and blocking probability for all M— K idles becomes 
(1 — QR,)*-*. Multiplying by the probability of M — K idles (2) 
and summing over all0 S$ K S E yield . 


m= és PK(1 — P)?-E(1 — ORy)-*, (3) 


the network blocking probability for the “‘toll-neutral”’ case HS M =F, 
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Fig. 3—Graph of toll-neutral network. 


as opposed to the “neutral-neutral” case E = M = F given by (1), 
in the form 


ms = (1 — POR)™ = POR". (1a) 


We have set Ro = R since F = M and Py = FP since FL = M. 
By binomial theorem, the sum (8) is 


me = (1 — QRo)*-=(1 pone PQR,)®, (4) 


but this may be derived by a more direct route. To see this, we note 
that there is a minimum of J = M — E idle A-links, no matter what 
the status of the # input terminations. Let us set aside J such idle 
A-links in Fig. 3, denoting them by dashed lines. This partitions Fig. 2 
into two parallel graphs, the upper one with J independent paths and 
all A-links idle, and the lower one with F independent paths. The 
lower graph is just the neutral case so that, mirabile dictu, its A-links 
have independent occupancies P if its input terminations do. Thus, 
blocking probability for the lower graph is given by (1), with Po re- 
placed by P and M by #. Blocking for the upper graph is just 
(1 — QR,)!, as noted before (3). Network blocking (4) is then the 
product of these two terms. 


2.3 Toll-toll case 
There remains only the “toll-toll” case H S M = F, with the as- 
sumption of independent occupancies P and # for input and output 


terminations. The argument leading to (2) may be repeated to yield 
the probability 


ae (zl _ PX(1 — P)®-¥RY(1 — R)F-¥ (5) 


of having X busy A-links and Y busy B-links. 

The pure three-stage switch (Q=0) blocks only if none of the 44 —X 
idle A-links matches any of the M — Y idle B-links. There are (}) 
ways to arrange the idle B-links, but only (;,;% ») of these match all 
M — Y idles to the X busy A-links. Thus, assuming all arrangements 
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are equally probable, the mismatch blocking probability becomes 


X M 
= = ly! ! — Aq)! 
TXY Gye wee) X'V!/MV(X¥ + Y— dM)! ~~ (6) 
Multiplying (5) by (6) and summing over0 =< X S$ HandOSYSF 
yields the overall network blocking probability 


TR DS TwaB(X, Y)rxy. (7) 
X,Y 


Karnaugh? has performed this sum, using binomial theorem first 
on the X-sum to obtain 


= (ah eB) REG )errara 


and then on the Y-sum to yield the blocking 


_ EMF! 
7B” MiE+F—M™)! 





PM-F RM-E ( =p R) E+F—M (9) 


in closed form. Now zz in (9) appears to be the product of a ‘“‘com- 
binatorial” factor (4% -)/G@‘) and a “probabilistic” one. A direct 
derivation will help to bring out their origins. 

Again, there are at least J = M — E idle A-links and J = M—F 
idle B-links. We set these aside in the Lee graph in Fig. 4, denoting 
them by dashed lines, as before. No matching of dashed A- and B-links 
is shown in the figure, since this could not correspond to a blocked state 
of the network. There are £ solid A-links, corresponding to the neutral 
case, so that we can assume independent occupancies Py = P for them. 
Similarly, the F solid B-links will have independent occupancies R. 

Figure 4 clearly partitions Fig. 2 into three parallel graphs. The top 
graph has J independent paths and its blocking is obviously R?. The 
bottom graph has J independent paths, and its blocking is P’. The 
middle graph has H = MW —IJ—J=H+F —M _ independent 
paths, cach with transmission probability PR, so that its blocking is 





Fig. 4—Graph of toll-toll three-stage network. 
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(1 — PR)*-'-/, The product of these three blocking probabilities, 


rp = PIRIPRM-1-5, (10) 


is the blocking for the network configuration in Fig. 4 and is also the 
“probabilistic” portion of wz in (9). 

There are (3/) ways to arrange the dashed B-links, but only (“7 7) 
of these match all J dashed B-links to solid A-links. If all arrangements 
are equally likely, the quotient (3)/(4‘) is the probability of the 
“blockable”’ configuration in Fig. 4. This accounts for the ‘‘combina- 
torial” portion of zs in (9), which is in fact the blocking at full oc- 
cupancy P = R = 1. Note that, if M > H+ F, then (ZF + F — M)! 
is infinite and rz vanishes in (9), consistent with Clos’ result. 


2.4 Generalized toll-toll case 


Even if Z of the AJ — X idle A-links match Z of the M — Y idle 
B-links in (5), a generalized three-stage network may still block, with 
probability Q7, for P, Q, and R independent. There are (#/) ways to 
arrange the M — Y idle B-links, but only (¥ 7*)(,-}-z) ways to 
match just Z of them to M — X idle A-links and the rest to the X 
busy A-links. For equally likely arrangements, then, the probability 
of a Z-match becomes 


= (U5 aw -¥—2)/(¥): 


The overall network blocking probability now is the sum over X, Y, 
and Z of the product of (5), (11), and Q?, 


TR >= > Q2raBnlX, Y)rz(X, i) (12) 
X,Y,Z 


As an example,? Mf = 128 and L = N = 105 in the No. 4 Ess so that 
E =F = 104 and J = J = 24. This makes wz in (12) the sum of 
302,845 nonzero terms. 

As discussed previously, a more direct derivation of blocking 
probability sz may be prosecuted for the generalized three-stage 
network. We set aside J idle A-links and J idle B-links as dashed lines 
in the Lee graph, Fig. 5. This time there may be some matching of V 
dashed A- and B-links, forO0 S$ V SI, J. Figure 5 partitions Fig. 2 
into four parallel graphs with J —V, V, J-—V, and H+V=M 
—I-—J+V independent paths, respectively, to yield blocking 
probabilities of (1 — QR)”, Q’, (1 — PQ)’, and (1 — PQR)#+Y 
as discussed earlier. 

There are (§‘) ways to arrange the J dashed B-links but only 
(+)(, 2) ways to match just V of them to the J dashed A-links and 
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Fig. 5—Graph of toll-toll generalized three-stage network. 


the rest to the # solid A-links. Thus, the probability of a V-match is 


if all arrangements have equal likelihood. This is multiplied by the 
blocking probability for a network with V matches, as in Fig. 5, and 
summed over V to yield the overall network blocking probability 


TB = ey ayQ’ (1 — PQ)I-"(1 — QR)" (1 — PQR)#*". (14) 


Ili. SECOND METHOD 


For the example of the No. 4 Ess, 7g in (14) is the sum of 25 non- 
zero terms. It does not appear to be possible to perform the V-sum 
and reduce (14) to a single term; however, we shall see that +s does 
have a simple one-term generating function, as well as one-term 
operator-product expressions. 


3.1 The generating function 


It is possible to perform the X-, Y-, and Z-sums in (12), and thus 
reduce zz to the simpler form (14). While tedious, this exercise is also 
highly instructive. Writing out (5) and (11) at length in (12) yields 


(M —X)! ,,(M—Y)! 





ee XOZRY F-Y 
TB rn Q R (E cae X)! Qa (F = Y)! B WXYZ) (15) 
where a = P, and 6 = R and the last factor is 
= EIF\/ M12! 
ee Me RE ZO iY DIE SY a A) 
_ EFI(M\(M—Z x | i 
~ MPE\Z xX M—Y-Z 
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Substituting the following identities into (15), 
(lq — X)! 


(fx! QE-X = gM —EgM—-X — glyM—-Xx (17) 
— ! 
Fe ai BF-¥ = Oe Be = aZ6"-Y, (18) 


where de and 0g are the a- and 6-derivatives, yields 


FI — 
y XZ 





Bre ey (20) 


We should note that, formally, a and 6 are independent variables, 

which are set equal to P and # only after all differentiations are per- 
formed. Thus, d« and dg do not act on P and R. 
By binomial theorem, the sum in (20) is just 
= B2ZRM-X—-Z(8 + R)*, 


which reduces (19) to 


as =i" ato5 E 078*( 7 7) Sx (21) 
Se = ( oo ) PX ql-XRM-X-2(g + RX, (22) 


Similarly, the sum in (22) may be performed to get Sy = a?y™—4 for 
y = ak + BP + PR and reduce (21) to 
_ HIF! M 

7B = TTR 020% y ee a7 84Q2y* 4, (23) 
The sum in (28) is just (a8Q + 7)”, by a third application of bi- 
nomial theorem, whence it becomes 

ee 2! ee 
™ = 7p 030% (a8Q + ak + BP + PR)™ 


= 7) al(aQ +P) (a8Q + 7)". (24) 


The forms (24) are about the closest we can get zz to a simple 
closed expression. Leibniz’ rule for the derivatives of a product will 
give (24) the form 


™ = B myQ" (aQ + P)7-’ (BQ + B)IH’ (aBQ + y)"*", (25) 
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with zy as in (13). Substituting a = P ands = R will now reduce (25) 
to (14), since, for example, 


aQ+P=P(1—Q+1-P=1-—PQ (26) 
aBQ + y = B(aQ + P) + R(a + P) 
= RQ — PQ) +1—R =1— PQR. (27) 


What is particularly illuminating in the preceding calculation is 
step (24). It says that ra(J, J), the blocking probability for a network 
with J excess A-links and J excess B-links, is a differential coefficient 
of “something.’”’ To make this idea more precise, we construct the 
corresponding generating function G. Multiplying (24) by (#4) U7 G) W7 
and then summing over all 0 S$ J, J S M yield the form 


on (F)o(F) wate 


U' Wl 
= » Ti Tr 020% (aBQ + aR + Br + Pk)”. (28) 


By Taylor’s theorem, the second line is just 


G=(U+P)W+RQ+(U+P)R+(W+ RP + PRIM 
CU+)W+1)-(+P)\W+ RQ)! 


+i +y¥|1- (1 T)(t- 9 )a]". 
(2 


~ O+F1 W+i 


Thus, rs (J, J) can be obtained by expanding (29) in powers of U and 
W, then inspecting the coefficient of U'7W/. In practice, we get the 
coefficient by differentiating J times in U and J times in W, then 
setting U = W = 0. But this is exactly equivalent to the steps leading 
from (24) to (14). 


3.2 Fundamental lemma 


In the preceding section, it was shown that blocking probability wz 
could be obtained quickly and directly from a generating function G, 
rather than through an argument involving a four-way partition of 
the Lee graph. If we could now obtain G directly, without steps (5), 
(11), (12), and (15) to (24), a great deal of effort might be saved. The 
structure of G is indeed determined by the following lemma, which has 
surprisingly little to do with switching networks. 

We idealize the network as a “‘black box,’ as in Fig. 6, on which a 
certain set of Mf ‘‘trunks’’ are terminated. These are assumed to have 
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hs 
P 
Fig. 6—Network idealized as black box. 


independent occupancy probabilities P, except that some number 
I s M of the trunks are “dead” or disconnected, with zero occupancies. 
Associate with the black box some quantity A(K), a function of the 
number K of busy terminations among the H = M —T “active” 
trunks. By using (2), we find the average value of A to be 


I 


(LP)=¥ ( se ) PK(1 — P)¥-I-KA(K), (30) 


BR K 
We have assumed that the M trunks are interchangeable, in the sense 
that A, and hence z, does not depend on which I of the M are dead. 


Fundamental Lemma: The generating function for +, P) ts written 
in terms of (0, P) as 


g=x(7T)umGy=U+y%*(o754) GD 


Formal Proof: Substitute (80) into (81) and use binomial theorem to 
perform the sum over0 SJ SM. 


Informal Proof: Suppose that each trunk has an independent proba- 
bility \ of being active, and hence \ = 1 — d of being dead. Then the 
average value of z is just 


e=-5 (7 ) MM-I(1 — d)Fa(I, P). (32) 


On the other hand, each trunk carries an average load of AP erlangs, 
independent of the others, so that # = 7(0,\P). We now define 
U = X/X, which yields \ = 1/(U + 1), and set G = (U+ 1)”# to 
transform (32) into (31). The model of “dead trunks” is invoked only 
to validate the mathematical relation (81), of course, and need not 
accord with observed behavior. 

An obvious application of the lemma is to let the black box be a 
switching network and z be its blocking probability zz for some pair 
of terminations. As an example, consider the “toll-neutral’ case of 
the generalized three-stage network, with EF < M =F. Let the 
trunks be terminated on the same first-stage switch as the input 
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termination of the pair whose blocking is sought. Then J = 0 corre- 
sponds to the neutral case H = M, for which (1) is valid with Py = P. 
Now the generating function becomes 
G=(U+1™ ae a 
= (U + 1) [1 — (1 ~ pq) 2% 
=(U+1-— (U+ P)QRo]". (33) 
Differentiating J times at U = 0 yields the blocking 


rs(L, P) = 08/1! Ge 
= (1 — QR) (1 — PQR)*-, (34) 


which is the same formula as (4). A bare minimum of knowledge about 
the network structure, just formula (1), was thus sufficient to deter- 
mine blocking probability. 

The ‘‘toll-toll” case, H S$ M =F of the generalized three-stage 
network, requires an iterated form 


G=r ) ul ( ) W'a(I, P; J, R) 
rr\t mf 
= M I . ft 
= ee ) U's (1, Pi, 7) (W + 1) 
7 Se eo (35) 
7U+1’ °W+1 
of the lemma. For » = M trunks terminated on the same last-stage 
switch as the output termination of the pair whose blocking is sought, 


we see that J = J = Ois the neutral case H = M = F, with wz given 
by (la). Thus, substituting (la) into (35) yields a generating function 


owen sols (— ya) wi) 
( 


which is the same as (29). The blocking becomes 


Ta = OY0WG/T! ) J! (‘7 ) = dyg/J! (a) (37) 


9 = Wey -(1- wig) a] 
[3 — Pp (1 = wr) of (38) 
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and applying Leibniz’ rule for the derivatives at W = 0 again yields 
(14). 

Another approach is to observe that we have already “differentiated 
off” I terminations in the “toll-neutral”’ case. Applying the lemma 
once more to (34), we can construct the generating function g in (88) 
at once and then “remove” the J spare output terminations as in (37) 
to obtain blocking probability (14). Again, only (1) was needed to 
specify the particular network under consideration. 


3.3 Operator formulation 


Lemma (31) may be “solved” for r(UZ, P) by making a formal 
expansion of its right side in terms of 0, the P-derivative. Observing 
that 


erepe = x ae) ee -5 (any ps = (@Py (39) 


and hence A? *f(P) = f(AP), the generating function is 


M — Pa 


G = (U + 1)*-P 8g (0, P) = = : 


) Utr(0, P). (40) 


Equating powers of U in (31) and (40) yields the identity 


rPy=(" 77?) ror) /(7), (41) 


which we write out as an operator product 


AE Ps (1 2 wor) ( = wo)( és at) 7(0, P) 


7 (1 ~ were) r(I — 1, P). (42) 
This has a simple and natural interpretation: ‘‘deloading operator’’ 
1 — Pd/E serves to “remove” one of the # remaining terminations 
represented in the expression +(M — E, P) for any average network 
property 7. 

Of course, the validity of the mathematical manipulations above 
must still be demonstrated. This hinges upon establishing convergence 
of the operator expansion in (40), and hence the sum in J. But we note 
that each factor of the form » — Pod will annihilate the corresponding 
power P” in 7(0, P). Thus, if z is a polynomial of degree M at most, 
as in (80), it is annihilated completely in all terms of (40) for which 
I exceeds M, and the expansion indeed converges, since the sum is 
finite. In practical calculations of blocking probabilities or other 
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formulas, the operator formulation (42) may not be as convenient to 
use as direct differentiation of the generating function. 


IV. SUMMARY AND DISCUSSION 


Two techniques are introduced to extend C. Y. Lee’s method to 
switching networks of ‘‘toll’” types defined in the text. Basically, these 
have some expansion in the first stage or concentration in the last 
stage, or both. The link independence assumptions of Lee’s method 
are inconsistent in such cases, which causes some inaccuracy. 

The first of the two extension techniques partitions the Lee graph 
into two or more smaller graphs, so that the independence assumptions 
will be consistent within various portions. This is done by setting 
apart the proper number of links that are known to be idle. The first 
technique is applied to examples of three-stage networks, yielding a 
result of Karnaugh’s and a simplification of the expression used for 
computing blocking for the No. 4 uss. To focus attention on the 
methods, all examples worked out are blocking probabilities, but other 
network averages may be treated similarly. 

The second, and more general, technique makes use of a lemma that 
characterizes the generating function of an average property for a 
whole family of networks. In a sense, this technique is a counterpart 
of the first, since it operates by attaching a sufficient number of idle 
“phantom” terminations to the network to make the link independence 
assumptions valid, instead of setting aside idle links. Then the gener- 
ating function can be constructed from the resulting ‘‘neutral” case, 
and the lemma, or an equivalent operator formulation, allows removal 
of the excess terminations. Two of the three-stage network examples 
are reworked to make a comparison of the two techniques. 
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APPENDIX 


The ForTRAN subroutine listed in Table I calculates the blocking 
probability zg given by (14) for the ‘‘toll-toll”’ case of the generalized 
three-stage switching network. Cancelling among the nine factorials 
in (18) yields an efficient calculation and minimizes the chance of 
underflow or overflow. The routine is accessed by: 


cALL BLOCK (J, P, J, R, M, Q, PI) 
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Table |— Blocking probability subroutine 


SUBROUTINE BLOCK (I, P, J, R, M, Q, PI) 
Ketel J 

L = MAX0(0,I + J — M) 

PQ = 1.0 — (1.0 — P)*(1.0 — Q) 

RQ = 1.0 — (1.0 — R)*(1.0 — Q) 

SQ = 1.0 — (1.0 — R)*(1.0 — Q)*(1.0 — P) 
A=1.0 

IF(L.GT.0) A = Q**L 

A = A*PQ**(J — L)*RQ**(I — L)*SQ**(K + L) 


B=A 

C = Q*8Q/(PQ*RQ) 
D = 1.0 

LOW =L+1 


LIM = MINO(I, J) 
IF(LIM — L) 30, 20, 10 
10 DO15N = LOW, LIM 
A = A*C*FLOAT((I — N + 1)*(J — N + 1))/FLOAT((K + N)*N) 
B=B+A 
15 D = D*FLOAT(K +L + N)/FLOAT(M + N — LIM) 
20 PI = B*D 
30 RETURN 
END 


and, referring to Fig. 1, the arguments are: 


Il=M—#H=M —N+1 = minimum number of idle A-links 
J=M—F =M—2L+1 minimum number of idle B-links 
M number of center-stage switches 

P average occupancy probability of input terminations 

Q blocking probability of center-stage switches 

FR average occupancy probability of output terminations 

PI returns the calculated value of zz. 


Independence is assumed for all P, Q, and R. 
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Test Facility for a Message-Switching System 


By W. E. BRACKER and E. R. SEARS 
(Manuscript received October 8, 1975) 


This paper describes the development and use of a test facility for a 
computer-based store-and-forward message-switching system. The test 
facility provides a close approximation to actual operating environments, 
allowing the system to be tested in a realistic environment. The facility 
provides a vehicle for many essential system studies including performance 
monitoring and measurement, load testing, certification, and validation of 
various simulation and forecasting models. A wide range of tools is used 
in this faciltty—hardware and software monitors, terminal and network 
simulators, and data reduction and analysis packages. One version of the 
system is used to drive and thus test another version. A language for 
creating a set of test data 1s defined and is used to present various param- 
eters to the switching system. 


l. INTRODUCTION 


A comprehensive test environment is important in the development 
and maintenance of any large computer system. In particular, when 
dealing with a message-switching system, special attention must be 
given to message loading, traffic mixes, and network configuration 
variables. Two methods of controlling these variables to test the system 
are: 


(2) The use of real terminals and operators to drive the system, or 
(2) Simulation of terminal and operator system interactions to 
exercise the system as though it were in live operation. 


The second method was used in our approach because it leads to more 
comprehensive, more controlled, and easily repeatable tests. 

We describe the concept of ‘‘using the system to test the system’’— 
a test facility is created from an existing system configuration. The 
system described is B1scom,* a large-scale, computer-based, store-and- 
forward message switcher. 


* Business Information Systems Communication System 


857 


Early in the development of Biscom, the need for a comprehensive 
facility was recognized. The important factor in the creation of the 
facility was the need for a controlled test environment—one in which 
the operation of B1iscom could be observed and measured. 


Il. BISCOM OVERVIEW 


Figure 1 illustrates a typical Biscom installation. For reliability 
reasons, BISCOM is configured as a dual system with two central 
processing units (cpus), each cpu having a complete set of peripheral 
units. One system serves as the Active (primary) message switcher 
while the other system acts as the Recover (backup) system in case 
of a failure in the Active system. This division is purely logical; either 
of the two systems may be the primary or recovery system (the com- 
munication network interfaces are attached to the machine serving 
as the active message switcher). BIscom is designed to operate on a 
Xerox* Sigma-5 computer with core memory sizes up to 128,000, 32-bit 
words. 

During Active system operation, all significant events are logged 
as follows: 


Message Journals 


(z) All User Input Messages 
(zz) All System Internally Generated Messages 
(721) Messages Fetched or Delivered From System Data Files. 


Logging Functions 


(z) Input Start of Message 

(77) Input End of Message 
(t2z) Output Start of Message 

(zv) Output End of Message 

(v) On-line Network Changes 

(vi) Disk File Transaction Data 
(viz) Periodic Terminal Handler Statistics 
(vitt) Disk Retry and Parity Error Statistics. 


The complete text of all switched messages are journalized (recorded) 
on magnetic tape. These tapes are input to a set of programs that keep 
the Recover system up to date with the Active system (e.g., they record 
the current state of the data base, network configuration, undelivered 
messages, etc.). 

In the event of an Active system hardware or software failure, the 
Recover system becomes the Active system. At the present time this 


* Xerox is a trademark of the Xerox Corporation. 
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Fig. 1—niscom dual system. 


process is manual, although testing of an automatic switch-over in 
case of failure is underway. In future versions of Biscom, logs and 
message journals will be sent directly from the active to the backup 
system over a channel-interface unit. 

BISCOM uses access arrangements including data-link control pro- 
cedures that agree with those contained in Standard X3.28 (1971) 
published by the American National Standards Institute. These access 
arrangements include low and medium speeds, asynchronous and 
synchronous, respectively; switched and dedicated access lines; and 
remote computer access at speeds up to 50 kilobits/second. 

Messages to be switched by Biscom consist of a header and text; 
message framing is accomplished by three control characters: start 
of header (SOH), start of text (STX), and end of text (HTX). The 
system is text transparent in that only header information is scanned 
and validated, although illegal characters in the text are recognized. 
Information in the header specifies the actions B1scom is to take with 
respect to the message. If the header is invalid or incomplete, delivery 


Table |— Message mix and traffic loads for a typical busy hour 
Hise Deliveries? Aaa 
Type’ Ms Hous Length’ Mac, Meu 
cee ioe | ante: || Gi 
POSM 125 100 1 125 
SMST 1200 425 0 
SMDL 200 500 1 200 
SMFT 3000 25 3000 
SMRD 700 440 1 700 
SMRD$ 500 440 3 1500 
SMRD 100 440 3 300 
SMRD 400 700 1 400 
SMRD 600 700 1 600 
SMRD 200 440 2 400 
SMSD 1240 440 2 1 3720 
SMSD 320 440 4 1280 
SMSDI!! 30 600 7 210 
SMCL 125 25 0 
SMRP# 625 425 0 
Total Input 9365 Total Output 12,435 


: See Appendix A (POSM = straight switched). 
Byt 


es. 
t Tow 110 baud, 150 baud terminals; med—2400 baud terminals. 
§ 500 messages in the SMF file will be replaced with 440-byte messages. The new 
messages are delivered to three medium-speed terminals. 
1! 30 messages of 600 bytes each are stored in the SMF file and delivered to seven 
low-speed terminals. 


# 625 messages of 425 bytes each will replace an equal number of messages already 
in the SMF file. 
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will not be attempted and the originating station is notified. If the 
header is acceptable, the message is switched to the designated loca- 
tion(s) and/or stored in one of the system data files. Appendix A 
provides a description of the message syntax accepted by BISCOM. 

A BISCOM message mix is determined by the types of messages input 
during a given hour, while message load is determined by the number 
of input and output messages processed during this period. Table I 
shows a typical busy-hour message mix and traffic load for a sample 
BISCOM installation. The Biscom busy hour is defined as a fixed per- 
centage of the theoretical upper limit of input and output messages 
that can be processed during a selected peak 60-minute period. 


Ill. TEST FACILITY OVERVIEW 


The test facility is comprised of three major components: terminal 
and network simulators, hardware monitors, and software monitors. 
The logical configuration is shown in Fig. 2. 

The terminal and network simulators drive the Active system; a 
set of real communication lines in the Active system is attached to 
simulated terminals in the Driver. In conjunction with a test message 
format handler in the Driver, the simulators respond to polling and 
selection sequences from BiscoM, and transmit and receive traffic. 

Hardware monitors are attached to the Active system to monitor 
physical resource utilization through test points that include the cpu 
hardware and peripheral subsystem. Output from the monitors is 
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MONITORS 


ACTIVE SYSTEM 
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HARDWARE 
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NETWORK 
HARDWARE REAL TERMINAL 
MONITORS NETWORK 


Fig. 2—The test facility. 
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stored on magnetic tape and forms input to a set of data-reduction 
routines. 

The software monitors, which are incorporated in the code of both 
the Active system and the Driver, serve to measure system physical 
and logical resource utilization. The Active system outputs statistics 
every three minutes to a Reports terminal, and also logs significant 
system events onto magnetic tape; the Driver periodically outputs 
communication line profiles to a line printer. 


IV. TEST FACILITY DETAILS 
To examine the test facility in detail, we discuss 


(¢) Terminal, network, and traffic simulations. 
(22) Test messages. 
(217) Measurement tools. 


4.1 Terminal, network, and traffic simulation 


To create the Driver portion of the test environment, the normal 
recovery system is replaced with load Driver software, as shown in 
Fig. 3. The Active and Driver computers are connected by trans- 
mission lines that terminate in low- and medium-speed data sets. The 
network and corresponding communication lines are defined in both 
the Active and Driver systems by a network-generation program. This 
definition consists of core memory tables that characterize the lines 
in terms of speed, character code, and access arrangement. 

The Active network consists of a set of lines terminating in real 
terminals and simulated terminals; the Driver consists of a set of 
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COM 1/0 PROC 
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Fig. 3—Example of the active and load-driver configuration. 
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real terminals and terminal simulators. Active simulated terminals 
are connected to the Driver via low- and medium-speed data-set con- 
nections. During the course of a test run, traffic is received from a 
subset of driver terminals and directed to primary network real and 
simulated terminals and the various Biscom data files. 

To the Active system, the simulated terminals appear real—no soft- 
ware changes are made in the Active system to distinguish between 
the simulated and real portions of the network. 

The Driver software represents a slightly modified Biscom. Existing 
BISCOM software is used to perform support functions, such as message 
queuing, and special software is added that communicates with the 
primary system to generate a flow of traffic. This special software 
consists of two terminal simulators and a test-message format handler. 
One of the simulators handles medium-speed (2400-4800 bits/second, 
synchronous) communication lines; the other simulator handles low- 
speed (110 bits/second, asynchronous) communication lines. A Format 
Handler program intercepts test traffic queued to the Driver simulators 
and restructures the traffic for transfer to the Active system. 

As far as the Driver software is concerned, simulators are identical 
to normal terminals. It is this concept which permits the BIscom system 
itself to be employed as the Driver. When activated, a simulator makes 
the decision, when polled, whether to send a message, and, when 
selected, whether to receive a message. The simulators may also re- 
spond negatively to polls or selections for a predetermined length of 
time. 

Driver test traffic input may come from one of two sources: (2) real 
terminals defined in the Driver network or (iz) a tape file. Messages 
that have been created by an off-line process and stored on the tape 
file may be sent by the Driver to the Active system and switched as 
if they were coming from actual terminals; of course, such messages 
must conform to valid Biscom syntax. Prior to a test run, a tape is 
generated representing a specific load test; for example, it may be 
desirable to perform a 1-hour test representing a load of 40 percent 
of an hour’s traffic mix for a particular installation. The necessary 
parameters are input to a PL/1 program (SIMGEN) which generates 
an appropriate test tape (see Fig. 4). Currently, a library of test tapes 
is maintained, representing many different loads, message mixes, and 
test durations. 


4.2 Test messages 


Message-load and message-mix data is jointly forecasted by a Bell 
System operating telephone company and Bell Laboratories. This data 
determines specific message inputs that are made available to the 
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Fig. 4—Process of creating test-message tape. 
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driver system. These inputs are associated with the simulated ter- 
minals, and the driver system sends these messages to BISCOM at a 
controlled rate. Conversely, the messages sent to the primary system 
will stimulate activity and responses (i.e., deliveries of messages to 
simulated and real terminals) ; therefore, the rate at which the driver 
accepts messages from BIscom must also be controlled. By establishing 
this control on the transmission and receipt of messages associated 
with the primary system, a load capability is created. 


4.2.1 Test message format 


A BISCOM message has the format 
(SOH) Header (STX) Text (ETX). 


This syntax does not change in the load driver although message 
processing is modified by the terminal simulators and format handlers. 

The basic Biscom format is expanded within the text portion to form 
the syntax of the test message 


DH CS = 
60H) reapeR ST) sporp *SELs *SOH HEADER 
PT 
*REP r *STX THXT *PAD d ‘PT. 


The test message is broken into four major sections: DH (Driver 
Header), CS (Control Section), PH (Primary Header), and PT (Pri- 
mary Text). These message sections (7) direct test messages to driver- 
simulated terminals, (zz) control the responses of these terminals to 
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BIScoM polling and selection, and (777) define the text patterns sent 
when positive poll responses are desired. Examples of various test 
messages are given in Appendix B. 


4.2.2 DH (driver header) section 


This section designates a simulated terminal in the load-driver net- 
work. Normal Biscom mechanisms are used to queue the test message 
to a simulated terminal; however, instead of its normal delivery of the 
message to a real terminal, Biscom releases all control over the message 
and activates the appropriate terminal simulator. The simulator, as 
part of its operation, will call the test-message format handler which 
reconstructs the message for delivery to the primary system. 


4.2.3 CS (control section) 


The test-message control section controls the response of the simu- 
lated terminals to poll and select sequences from the primary system. 
By regulating the time between positive polling or selection responses, 
it is possible to present any desired message-transfer rate to the primary 
system. 

The *POL p and *SEL s subsections specify that a delay is to be in- 
troduced before either a positive polling response or a positive selection 
response is given. Polling and selection response delays stay in effect 
until reset by another test-message control section. 


4.2.4 PH (primary header) section 


Entries in this field may represent any valid Biscom header infor- 
mation, as shown in Appendix A. When a terminal simulator is ready 
to give a positive response to a primary polling sequence, the *SOH 
will be converted to the (SOH) control character. The header has no 
processing performed on it and must correspond to valid primary net- 
work terminals and/or data-base operations. 


4.2.5 PT (primary text) section 


This section defines the text portion of the test message. As in the 
case of *SOH, the test message format handler converts the *STX to 
the (STX) control character and performs other operations on the 
TEXT subsection if specified by the *REP subsection. *REP r operates 
on the PT section and on the PH section simultaneously, r specifying 
how many times this particular message is to be repeated before the 
simulator releases it and goes to the next test message. The *REP 
subsection is also used in conjunction with a special character ‘#’, 
which is used for sequence numbering. 
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The TEXT section may be alphanumeric strings of any length. 
*PAD d specifies how many repetitions of a 20-character ‘‘filler’’ se- 
quence are to follow the TEXT subsection. In this manner, messages 
of varying lengths may be transferred to the Active system without 
the need to supply all the characters in the test message. 


4.3 Measurement tools 


The measurement tools available in the Biscom test facility may 
be grouped into three categories: 


(t) Hardware monitors 
(it) Software monitors 
(zit) Logging analysis programs. 


The use of these tools along with their advantages and shortcomings 
are discussed in the next section. 


4.3.1 Hardware monitors 


COMRESS Corporation’s Dynaprobe® hardware monitors, models 
7900 and 8000, are currently used in the test facility. These monitors 
are used to collect data concerning system hardware component 
activities; for example, they measure the number of seeks, reads, and 
writes performed by the disk subsystem; the amount of disk controller 
and disk pack utilizations; the amount of Central Processor utiliza- 
tion; and the amount of magnetic tape controller activity. The hard- 
ware monitors also have the capability of monitoring instruction- 
execution activity as a function of instruction address, making possible 
the production of “maps” of instruction activity by memory location 
(see Fig. 5). This capability has been particularly useful in locating 
anomalous program loops and in selecting program modules as candi- 
dates for code optimization. 

A hardware monitor does not perturb the system being measured and 
thus does not bias the measurements made. In addition, hardware 
monitors have a certain amount of flexibility and provide a way to 
make measurements that are not readily attainable by other means, 
e.g., Measuring instantaneous cPv utilization, core usage, and interrupt 
enable/disable times. 

The data gathered by the hardware monitor is used in the validation 
of various simulation and forecasting models of Biscom. There are, 
however, many software-related measurements that cannot be ob- 
tained by this method. For example, it is difficult to measure buffer 
utilizations, message processing times, and task activations. Software 
monitors must be used to measure this activity. 
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Fig. 5—Typical reduced output of hardware monitor. 


4.3.2 Software monitors 


There is limited software monitoring capability built into the Active 
system and Driver. The Driver software contains a statistics-gathering 
routine that periodically prints statistics on the network interactions 
with the Active Biscom system. There is also an Active system soft- 
ware monitor routine that collects data on buffer pool utilizations, 
queue lengths for various processing tasks, and the number of mes- 
sages entering and leaving the system during specified time intervals. 

Significant system events are also logged on magnetic tape for use 
by the Recover system. In the test environment, these log tapes serve 
as Inputs to an analysis program. This analysis program outputs 
statistics on message volumes and switch time (Table II) ; future plans 
call for reporting data-base activity and network performance through 
expanded system logging. Such logging is part of normal BIscoMm 
operation, and the test facility makes use of already available data. 

Two of the major advantages of software monitors over hardware 
monitors are the wider range of system attributes that can be mea- 
sured and the greater flexibility in the control of data collection. One 
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Table Il— Typical log analysis output 


Average Messages 
Number of Average Average Delivered/ 


Message Type Messages Bas Chars In | Chars Out Cleared 


POSM* 5 


0.06 89.00 89.00 5 
SMST 68 0.57 422.29 0.00 0 
SMSD 64 1.89 424.87 424.87 152 
SMRP 25 0.72 415.40 0.00 0 
SMRD 103 1.32 530.04 530.04 146 
SMCL 3 0.82 25.00 0.00 3 
SMFT 129 0.52 25.00 516.25 129 
SMDL 6 0.87 28.00 563.66 6 


Window times: 02 :55.34-03 :00.34; window length: 5.00 minutes 
No. messages = 403; 99.3% load; 100% input-message load = 4860 msgs/hr 


* Straight switched messages. 


of the disadvantages is that a software monitor interacts with normal 
system operation and thus uses some of the system’s resources. Another 
disadvantage is that special data-reduction-and-analysis software must 
be developed for a software monitor. 

Note that one use made of the hardware monitor is to estimate the 
impact of the software monitor operation on normal system processing ; 
in particular, it is possible to determine how much crv time the soft- 
ware monitor is using. 


V. SOFTWARE TEST ACTIVITY AND SYSTEM CERTIFICATION 


The tools comprising the test facility are being employed for soft- 
ware test purposes as well as for performance measurement and evalua- 
tion studies. The most heavily used aspect of the test-facility tool is 
the load-test capability. Whereas a load test is performed on each re- 
lease of Bisco prior to site installation, a measurement-and-evaluation 
effort is only applied to selected releases of the system. There are two 
major reasons for this: 


(t) Data collection, reduction, and analysis is a costly process, 

(it) Software changes on a per-release basis usually do not warrant 
a complete measurement-and-evaluation effort. However, those 
releases that contain performance-affecting changes have full- 
scale measurement-and-evaluation procedures applied prior to 
site distribution. 


The load test is the culmination of prerelease test activity on a given 
set of software changes. Prior to the load test, the software changes 
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are tested by the change originator for their functional correctness. 
The changes are then integrated into the Biscom software, and the 
entire system is subjected to a standard test series. The final stage in 
the test procedure is the application of the load test. 

As discussed in Section I, the load test has essentially three parame- 
ters: message mix, message load, and network. The message mix and 
related network configuration cause the software to react in different 
ways depending upon the processing requirements on a specific message 
type (designated by the message-header information). A message that 
requires a store into the data base has different requirements than a 
message that is switched directly from terminal to terminal. Likewise, 
messages that access different data files in the Biscom system have 
different processing requirements. The message-load parameter allows 
an approximation to the message-processing load expected to be ex- 
perienced in an actual operating environment. This processing load 
is jointly forecasted by a Bell System operating telephone company 
and Bell Laboratories. Beyond these parameters, system dependent 
variables can be stressed in a manner designed to cause system actions 
that are not normally met in a simple test. Typical variables stressed 
are buffer exhaustion and buffer boundary conditions. 

The load test is the final test that is applied before release of the 
software product. Data analysis is performed and compared with the 
analysis of previously released systems in a search for any discrepancies 
that might point to a design flaw. This analysis also ensures that the 
system does not regress (from previously released systems) in load or 
functional capability. 

After the software system is released, the load test is again used to 
exercise the released system on a continuous basis in search of design 
flaws or dormant software bugs. Exercising the system on a continuous 
basis involves activating the system and switching messages for several 
consecutive hours. 


5.1 Test results 


Test results to date can be classified into two categories, namely, 


(2) Uncovering of software problems. 
(21) Certification of the system-load capacity. 


The first category of test results includes an entire range of software 
problems. A typical example in this category was uncovered during 
system shutdown, the time during which the system cleans up traffic 
in progress, secures its data base, and transmits “good night” messages 
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to the terminal users. The system performed this function adequately 
when not under load; however, when shutdown was attempted in a 
loaded environment, it did not function properly. This condition was 
corrected before it occurred in a real environment. 

Another example of category (2) is the saving of machine states when 
high-priority tasks preempt current system activity. The software 
associated with “state” saving and restoring operated correctly until 
stressed in a load-test environment. This condition was also cor- 
rected. Violations of reentrancy conditions in the data-management 
functions were also uncovered and corrected in the laboratory 
environment. 

We also found that the load-test facility is capable of exercising 
the various hardware components to such an extent that it can cause 
the hardware (disk units, tape units, etc.) to fail and thus force the 
system to exercise its retry logic. This is highly important, since many 
hardware conditions and status responses are very difficult to create 
during normal system operation. 

The second category of test results relates to the throughput or 
capacity of the system. The point has already been made that system 
capacity is dependent upon the three variables: message type and size, 
message load, and network configuration. The laboratory has a close 
approximation to these variables as they relate to several live instal- 
lations. In this environment, it has been possible to certify the cap- 
ability of the system to process future traffic loads for these installa- 
tions. This has been very useful information for site planners, partic- 
ularly as it affects plans for routing more traffic through the system. 
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APPENDIX A 
Message Syntax 
A.1 Terminal-to-Terminal (Straight Switch)* 
Messages may be sent between terminals, or to and from the message 
storage files. The general format of each message consists of heading 


and text sections framed by delimiters. The heading must be cor- 
rectly structured, whereas the text section is “‘transparent’’ to BISCOM. 


_* This message type docs not cause access of any system data files and is switched 
directly from an originating terminal to destination terminal or terminals. 
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(SOH) DAC1 DAC2 ... DACn (STX) TEXT (ETX), where: 


(1) DACi is any 1-8 alphanumeric terminal logical name. 

(2) The DACs must be separated by blanks. 

(3) The total number of characters between (SOH) and (ETX) 
must be less than 256. 

(4) The total number of characters between (SOH) and (ETX) 
must be less than 6400. 

(5) TEXT may be any string that does not include certain control 
characters [e.g., (SOH), (ETX), ete. ]. 


A.2 Special Message File (SMF) 


(SOH) SM**. DACI ... DACn/MSGID/PAGE/DATE CON- 
TROL MM-DD (STX) Text (ETX) 


where ** indicates one of the following process codes: 


ST—Store FT—Fetch 
SD—Store and Deliver DL—Deliver 
RP—Replace CL—Clear 


RD—Replace and Deliver 


MSGID (Message ID) 
Associates a storage and retrieval key with the message in the 
data base, any alphanumeric string from 1 to 15 characters. 


PAGE 
Any number between 1 and 99; the last page is denoted by the 
suffix L (e.g., 10L); associates a multipage message with a single- 
message ID. 

DATE CONTROL 
D—Deliver on MM (month), DD (day) date, current year 


P—Purge the message on the MM, DD date 
C—Deliver and purge the message on MM, DD date. 


Example: 


Store a message in the smr of one page under the key ID100, 
deliver this message to terminals 37ASR1, SANDI1; purge the 
message on July 11 of the current year. 
(SOH) SMSD. 37ASR1 SAND1/ID100/1L/P 7-11 (STX) TEXT 
(ET X) 

Fetch the above message: 


(SOH) SMFT. /ID100 (STX) (ETX) (The message will be 
delivered to the terminal initiating the fetch.) 
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The SMF is the most heavily used system data file. Other system 
files are: Authorized, Resend, Batch, Paging, and Error. They have 
syntax similar to the SMF. 


APPENDIX B 
Test Message Examples 

Communication lines in a BISCOM system normally terminate in 
more than one terminal; however, it is assumed for purposes of this 
discussion that each line terminates in only one terminal with the 
logical names given in Fig. 3. Consider the following test messages 
queued to driver-simulated terminals; the # character specifies a 
field used by the driver for message-sequence numbering. 


INPUT TO DRIVER TERMINAL LTI1: 
(SOH) LT1 (STX) *REP 100 *POL 350 *SEL 100 LT2 PT1 *STX 
Test A# ## (ETX). 


Sent to B1iscom From Simulated Terminal LT1: 


(SOH) LT2 PT1 (STX) Test A001 (ETX) 
350 ms between messages sent to BISCOM 
(SOH) LT? PT1 (STX) Test A100 (ETX). 


Delivered to Simulated Terminal LT2: 


(STX) Test A001 (ETX) 
100 ms between deliveries 
(STX) Test A100 (HTX) 


Delivered to Primary Real Terminal PT1: 
(STX) Test A001 (ETX) 
(STX) Test A100 (ETX). 


The following example shows how a test message may be used for data 
base operations: 
INPUT TO DRIVER TERMINAL LT1: 


(SOH) LT1 (STX) *REP 9 *SOH SMSD. RT1 LT#/BTL## 
*STX TEST ## (ETX): 
Sent to Brscom from Simulated Terminal LT1: 
(SOH) SMSD . RT1 LT1/BTLO1 (STX) TESTO1 (ETX) 


(SOH) SMSD . RT1 LT9/BTLO9 (STX) TESTO9 (ETX). 
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Stored in Primary System Special Message File (SMF): 
TESTO1 Stored under key BTLOL 


TESTO09 Stored under key BTLO9. 


Delivered to Primary Real Terminal RT1: 
(STX) TESTO1 (ETX) 


(STX) TESTO9 (ETX). 


Delivered to Simulated Terminals LT1-LT9: 
(STX) TESTO1 (ETX) to LT1 


(STX) TEST09 (ETX) to LT9. 
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Calculation of Multidisturber Crosstalk 
Probabilities—Application to 
Subscriber-Loop Gain 


By P. M. LAPSA 
(Manuscript received January 27, 1976) 


The possibility that excessive crosstalk limits the application of gain to 
long voice-frequency loops was studied. This problem was examined by 
means of a probability model incorporating data on such random variables 
as activity coincidence of disturbing and disturbed subscribers, disturber 
volume level, coupling path loss, listener acuity, and various noise sources. 

A new approach was used with regard to the probability distributions 
of random variables: the probability distributions were modelled in detatl 
for the calculation of the necessary functions, sums, and functions of sums 
of random variables. Results showed that gain of 6 dB or less ts acceptable, 
and 9 dB is unjustifiably excessive. The approach used provides the most 
accurate calculation possible with available data, and is antictpated to be 
convenient not only for similar crosstalk evaluations but also for other 
nongaussian probability problems. 


l. INTRODUCTION 


An important concern in speech transmission is the avoidance of 
crosstalk. When it is intelligible, it is a potential violation of the 
telephone subscriber’s privacy ; when not intelligible, it is nevertheless 
an annoyance, especially if syllabic in nature. In subscriber loop 
equipment, as a result of the permanent assignment of a pair to each 
customer, such crosstalk situations might tend to be dedicated to a 
specific disturbing talker and a specific disturbed listener, who might 
even be known to each other. Therefore, the random occurrences of 
intelligible crosstalk must be limited to a probability that is very 
small indeed. 

On long voice-frequency subscriber loops, it is often desirable to 
overcome excessive attenuation of the voice signal by applying some 
gain at the central office. The higher signal level, however, increases 
the probability that intelligible crosstalk will be heard. The existing 
gain limit of 6 dB was suspected to be already the maximum possible 
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without risking excessive crosstalk probabilities. Therefore, desired 
increases to as much as 9 dB were considered problematic, and this 
analysis was undertaken to evaluate it. 

Intelligible crosstalk results from a fortuitous combination of 
several random variables, such as the activity coincidence of disturbing 
and disturbed subscribers, disturber volume level, coupling path loss, 
listener acuity, and various noise sources. The ill-defined, stochastic 
nature of the problem has always necessitated various approximations. 
For the present analysis, an approach more exact than previously used 
was needed for two reasons. First, preliminary calculations with these 
conservative approximations yielded unfavorable results even for low 
values of gain. Second, the probabilities involved were extremely small. 
Therefore, if there was any way to justify more than 6-dB gain, 
the least conservative, most exact, calculation would be the most 
likely to do so. 

Most of the complications in crosstalk problems arise from the 
difficulty of measuring and analyzing the probability distributions 
of the random variables; these make the problem in its entirety seem 
formidable indeed. Fortunately, because of its importance, past work 
has shed much light on the construction of a reasonable analytical 
framework and on the estimation of the distributions. Aspects of this 
study, especially those described in Sections 2.1 and 2.2, continue in 
the line of analysis used most recently by D. H. Morgen,! T. C. 
Spang, B. E. Davis, and M. G. Mugglin of Bell Laboratories. 

The main new development in this paper is the calculation of 
crosstalk in terms of nongaussian probability distributions, where 
appropriate for improved accuracy. The data is therefore better 
represented than if approximating gaussian distributions had been 
fitted to it, a point especially significant when evaluating the extreme 
“tails” of distributions, as required in this case. The improvements 
arising from this approach include: 


(t) The exact distribution of measured coupling losses can be 
considered, rather than an estimation based on worst-case 
coupling or other approximations. 

(2) The threshold for intelligibility can be represented as de- 
pendent on an exactly calculated “power sum”’ of noise sources. 

(122) Distributions can be truncated when appropriate, as for 
example at the 30-dBrnC noise limit. This avoids taking 
advantage of one plant deficiency, excessive noise, to mask 
another, excessive crosstalk. 


The problem is addressed in terms of the general crosstalk proba- 
bility model summarized in Section II. Section III explains in more 
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Fig. 1—General crosstalk coupling model. 


detail the calculation of the total noise distribution from the distri- 
butions of the individual noise sources, after which the intelligibility 
threshold can be determined. The data for the distributions of random 
variables is discussed in Section IV, and Section V deals with some of 
the computational aspects involved. Finally, the results are discussed 
in Section VI. 


Il. GENERAL CROSSTALK PROBABILITY MODEL 
2.1 The possibility of crosstalk 


The essentials of the crosstalk problem can be described very 
simply and are depicted in Fig. 1. There is a disturber of volume V 
active on channel 7, a disturbed subscriber is listening on channel J, 
and a coupling path with loss L* connects them. The listening acuity 
of the disturbed subscriber is characterized by his intelligibility 
threshold J. 

Usually there are several possible coupling paths. Figure 2 shows 
the far-end and near-end crosstalk paths involved when voice-frequency 
gain is used on subscriber loops. However, it is easy to show that the 
far-end crosstalk probability is small (see Appendix A). The type of 
crosstalk path considered most likely to be disturbing with voice- 





CENTRAL OFFICE 


Fig. 2—vre cable crosstalk coupling path. 
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frequency gain was near-end crosstalk (NExT) at the central office 
(co) because the gain appears twice in the crosstalk path. Thus, LY 
in Fig, 2 is seen to include the coupling loss X* in the cable, reduced by 
twice the gain. When all quantities are expressed in dB, 


Lis = XG — 9G. 


The disturbed listener is considered to be at zero loss from the co, 
so that crosstalk is most likely to be heard. 

Crosstalk by way of the coupling path ZL*/ from disturbing pair 
t ¥ 7 to listener / is possible when both are active and subscriber J is 
using pair 7. The probability of coincident activity of two independent 
channels has been usually given as 


a A ; 
pr-1-[— J, (1) 





where 


# = mean holding time of call, 
\ = mean quict interval between calls, 
A denotes coincident activity. 


This was a result of modeling holding times and quict intervals as 
exponentially distributed; a derivation under more basic assumptions 
is presented in Appendix B. From (1), the probability of coincident 
activity for rural offices has been evaluated, assuming traffic to be 
somewhat heavier than average but uniformly distributed over all 
channels, 

PZ = Pa = 0.167. 


With P’? denoting the probability that the disturbed subscriber / 
is using channel j, the probability that crosstalk can reach him by way 
of path ZL, that is, the probability of coincident activity over the 
path, becomes 

Po =F EP, tA]. 

In this case, the subscribers on all cable pairs 7 were considered 

equally likely to be called by subscriber 1, which leads to 


1 


yes 

Pua 

pie pay (2) 
ie N 


It may be noted that both the talker volume and the noise distri- 
butions have been implicitly assumed to be identical and independent 
for all disturbers and paths, for which reason neither V nor N has 
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been carrying the superscripts 7 and 7. There seems little reason to 
assume otherwise, although, if desired, V and N could be treated in a 
multidisturber context similar to that of L. 


2.2 The intelligibility of crosstalk 


The condition under which crosstalk over the path L*/ becomes 
intelligible to the disturbed subscriber is, in dB, 


VY => dl; (3) 


This relation can be elaborated to include the dependence of the 
intelligibility threshold J on the masking effect of all noise sources : 


(z) Circuit noise Ne 
(iz) Room noise Nez 
(viz) A fit parameter Ny (which can be somewhat loosely inter- 
preted as residual noise in the auditory system, defining the 
hearing threshold). 


The total noise level is the sum of individual noise sources when 
expressed in units of power, rather than dB: 


1ON/10 = JON1/10 4 JONs0 4... 4 1QVKM0 = y 1QNe/0 (4a) 
k 


or, in short, 


Ne NG Naty oe We (4b) 


where the operator +, in (4b) is defined by (4a). In general, the noise 
sources must be specified not deterministically but rather by their 
probability distributions. The resulting distribution of the total noise 
level N can be calculated as described in Section III; we may assume 
for now that N is known. Then the intelligibility threshold I can be 
modeled as the following linear function of N (see Section 4.3), 


Thus, the condition for crosstalk to be intelligible, eq. (3), becomes 
V—-L¥>Iin+N. 


Now define* 
RAV —Li—I,—N. (6) 


Equation (8) is now 
Ri > 0. (7) 


* The case of gaussian random variables has been treated previously in detail, as 
for example in Ref. 1. 
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2.3 The probability of crosstalk 

With that background, the probability of crosstalk over the path 
L*i can be calculated as the product of the probability of coincident 
activity over that path times the probability of intelligible crosstalk 
when active, 


P¢@ = P4-Péia, (8) 
where 
éja = Pr {R% > 0}. (9) 


This can be evaluated by expressing the probability density function 
of R*#, denoted by fz, in terms of the p.d.f.’s of V, L*4, Io, and N, denoted 
by fv, fz, fr,, and fy, respectively. By virtue of eq. (6), f# is the con- 
volution 

R= fv *f21 * f-1, * fw. (10) 


Restating eqs. (8) and (9) in terms of f7, 
Py = PY f° sucRyae. (11) 
0 


There are (N — 1) possible disturbers when the listener is on pair 7. 
The probability that disturbers on pairs 7 ¥ 7 exceed the listener’s 
intelligibility threshold is, therefore, 
PL=1-JJ a -— P%). 
ij 
The rare occurrences of babble due to two or more sufficiently loud 
disturbers are included. When P¥ are small, this reduces to 
PL= D> Po. 

Aj 
Considering all N possible pairs used by J, the total probability of 
intelligible crosstalk to subscriber / is, again for small probabilities, 


N eA 
2. 2e Fe 


Hl i455 


Pe 


Il 


N ey 
ye Py f secryar 


Jeli¥s 
N Py ie) , 
= at W(R ; 12 


As a result of the linearity of the convolution and integration 
operations, this double sum of integrals can be reduced to a very 
simple form. Returning to (6), we can define an intermediate random 
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variable 


SAV-Ih-N, fs = fv *f-1, *f-n. 
Thus, 
RUAS — LY, B= fg* flr. 
Now (12) can be written as 
= W-0Pa [fee yapoy &, Es ttear. 


The function 


i 47 
wD & Soe) 
is simply the result of averaging the N(N — 1) individual p.d.f.’s f%, 
over all possible channel combinations 7 # j. Denote this averaged 
p.d.f. by f_z: 


he tangy 
ft ay SD) Fe 
Then 
NS Pi [ifs + f_1(R)dR. 
For 


RAV-In-N-L, fe =fv *f-1, *f_w *f-1, (13) 
Pe = (N — Pa , ”fe(R)dR = (N — 1)Pall — Fe2(0)J, (14) 


where Fz is the probability distribution function of R. 

The multidisturber case is thus greatly simplified to a result parallel 
to the case of a single disturbing path, eq. (11), but with fr appropri- 
ately redefined by means of the average of the coupling loss p.d.f.’s. 
If all f# are identical, then of course, 

fu = fi 

and eq. (14) differs from eq. (11) only by the factor (NV — 1). For 
most cables, however, the f may not be identical for all pair combina- 
tions (2,7), because there may be a strong dependence on distance, 
shielding due to intervening pairs, and twist-length ratios. Noting 
that it would be a formidable task to specify the individual f?, we 
proceed to show how fx(Z) and the result (14) are easily available 
from a single overall cumulative distribution function. 

Since no entirely satisfactory analytic derivation of the f? has yet 
been devised, the most fruitful approach is to use direct measurements 
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of the coupling losses in cables (more specifically identified as cable 
units, because within-unit crosstalk predominates). The measurements 
form a symmetric, off-diagonal matrix for each cable (or cable unit if 
within-unit coupling is of primary interest), as shown in Fig. 3. By 
merging these results over all cable units measured, a cumulative 
distribution function F¥ in each (2,7) cell can be constructed, from 
which 


Z(L) = = [Pe (Z) ] 
and 


fu(L) = ; [F.(L)], 


where fF’, is the overall cumulative distribution function of all crosstalk 
measurements, 
FLL soe FY(L 
ae ye WOE 5%, > #( ). 

Therefore, even though the distributions differ from cell to cell, 
it is not necessary to maintain the identities (2, 7) of the measurements, 
a point which is most clearly evident when the distribution functions 
are expressed in terms of step-functions u(-) at the measured values. 
For K cable units measured, 


se 
FUL) = = Y w(L — Li) 


1 K ‘ 
WDE Di X u(L — Li) 


j=lixjk= 


Fy (L) = 
1 


where M = N(N — 1)K and the single summation extends over all 
measurements of all pair combinations. 

Section 4.2 points out that /, may have arisen from two distinct 
populations, adjacent within-unit (wu) pairs and nonadjacent wu 
pairs. In that case, proceeding as above, we can model F’; to show this 
explicitly : 


Fi(L) = & Fall) +o F,,.(0), (15) 


where A is the number of adjacent pair measurements. 

To summarize, the result (14) is parallel to the case of a single 
disturbing path in eq. (11), but with fe appropriately redefined. Of the 
four functions constituting fz, as shown in eq. (13), all but the noise 
power density fy can be derived directly from measured data. The 
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Fig. 3—Symmetric matrix of crosstalk coupling-loss measurements. 


calculation of fy is more complicated and is discussed in the next 
section. 


Ill. NOISE POWER-SUM CALCULATION 

It is necessary to evaluate the probability density of total noise 
power from stochastically independent noise sources. As summarized 
in eq. (4), the noise variables N;, in dB, must be transformed to units 


of power, e.g., watts, 
Wi = p(N;x) = 10%«/0 (16a) 


so that they can be added. 

Appendix C describes sums, functions, and general operator sums of 
independent random variables. To utilize these for the transforming 
function p(-), we only need to calculate its inverse and derivatives, 


p (Wi) = 10 logio Wi (16b) 
= p(Ni) = a 1Q¥#/10 (16c) 

d | _ 10 1 
aw, ? ‘Wr) = 765 W,. (16d) 


The term NV, and its p.d.f. f, thus transform as follows: 
10 1 
Nu, fu (Ni) = Wi, 9 (We) = in 10 Ww, 7#(10 logio Wi). — (17) 
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Then the total noise power, in units of power, is 


W=> W, 
P 


and its p.d.f. g(W) is the convolution of the g;, 
9 =gi* +++ * 9K. 
To invert W back to dB, we use Appendix C with u = p— to obtain 
N = 10 logio W = 10 logio X W, 


= 10 login © 10¥#/0 (18) 
k 


and 
In 10 1 


foc) = {BP roves) g(10%n9 
_ {In 10 
7 10 

This, with the aid of eq. (17), defines the p.d.f. of total noise power. 
The power sum convolution process can be summarized by applying 
it explicitly to noise sources mentioned in Section 2.2. As in Appendix 
C, the notation can be greatly simplified by introducing the symbol «, 


for this “power convolution,” analogous to the “power sum” symbol 
+, in eq. (4b). Then eqs. (18) and (19) reduce to merely 


N = Neo+pNrt+pNru (20) 
iv = fe *ofr *p fru. (21) 


The next step is to describe some data for the random variables 
discussed : V, L, Lo, Ne, Nr, and Nou. 


10%} gi* ec: * gx (10%), (19) 


IV. ESTIMATION OF THE PROBABILITY DISTRIBUTIONS 


As mentioned in the preceding sections, the main random variables 
are talker volume, intelligibility threshold, coupling loss, and noise. 
Data for them was taken from the best surveys available at this time. 


4.1 Talker volume 


The 1960 survey of talker volume at central offices as reported by 
McAdoo? was used to model talker volume. Loudness of intraoffice 
calls varied substantially with the size of the co and local calling area, 
the means ranging from —19 vu (volume units) in New York City 
to —29 vu in Enid, Oklahoma. The figure actually used was —26 
vu, as measured in Pascagoula, Mississippi, which has a population 
of 17,000 with 8000 station sets. This volume level was somewhat 
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below the survey’s nationwide average of —24.8 vu, in recognition 
of the use of vFa (voice-frequency gain) primarily in the smaller 
communities. 

The distribution about the —26 vu mean was modeled as normal 
with a standard deviation of 7.3 dB, as reported by the survey. 


4.2 Coupling loss 


Because crosstalk from within-unit (wu) pairs predominates over- 
whelmingly, the most accurate approach was to consider the distri- 
bution of wu coupling loss explicitly, rather than as a small tail of a 
merged distribution for all pairs in a cable. NExT coupling loss data 
at voice frequency was available from Bell Laboratories measurements 
of an electrically long, 22-gauge, 100-pair, even-count pic cable, 
comprising eight units of 12 and 13 pairs. Figure 4 shows the smoothed 
cumulative distribution function of 468 measurements in wv coupling- 
loss matrices. 
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Fig. 4—Distribution of 1000-Hz Next coupling loss (dB). 
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Coupling loss is usually assumed to be log-normally distributed. 
If this were true of the above data, then it would appear as a straight 
line in Fig. 4. A possible explanation of why this is realized only 
approximately is that loss is not distributed uniformly for all pairs, 
but instead differs for two (or more) distinct populations: adjacent 
Wu pairs and nonadjacent wu pairs. When this hypothesis was tested 
with some available data of coupling-loss measurements by South 
Central Bell, a substantial and consistent difference between adjacent- 
pair and nonadjacent-pair measurements was indeed found. 

Therefore, the distribution function of Fig. 4 might be interpreted 
as a merging of a distribution for adjacent pairs and a distribution 
for nonadjacent pairs, as in eq. (15). The total distribution function 
was divided in proportion to the number of adjacent and nonadjacent 
measurements so that a truncated normal distribution function could 
be fitted to each part. The approximation gave results that agreed 
reasonably well with those calculated from the distribution of Fig. 4 
exactly (see also Section V) ; in particular, the truncated normal fit to 
the low-loss tail, attributed to adjacent pairs, accounted for most of the 
crosstalk probability. 


4.3 Intelligibility threshold 


It was mentioned in Section 2.2 that the dependence of intelligibility 
threshold on total noise level is linear, as developed in Refs. 1 and 3. 
In this representation, eq. (5), Jo was modeled as normally distributed 
in dB with a mean of —95 vu and a standard deviation of 2.5 dB, 
based on the subjective tests by Sen.’ These values were determined 
in experiments in which coupling loss was flat with frequency. The 
threshold has been found to be slightly lower if coupling loss decreases 
with frequency ; in case of a 6-dB-per-octave slope, sensitivity would be 
2 dB greater for a male voice, or 1 dB greater for a female voice. 
However, in view of other conservative assumptions, this possible 
effect was neglected. 


4.4 Noise sources 


Section 3 described the total noise level as a power sum of circuit 
noise, room noise, and hearing-threshold noise. Their distributions 
were characterized as described below. 

The circuit noise distribution was taken from data of the 1964 
Loop Survey described by Gresh.! The data for long loops was used 
because this would be most appropriate to vre. The distribution is 
very nearly normal in dB, with a mean of 17.5 dBrnC and a standard 
deviation of 14.85 dB. However, two adjustments had to be made. 
First, it is clear from Fig. 2 that the gain would amplify the noise, 
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so that the distribution had to be shifted by this amount. Second, a 
considerable fraction of the loops exceeded the immediate remedial 
action limit of 30 dBrnC. Although the excessive noise would very 
effectively mask crosstalk, it was deemed better to assume that either 
this plant deficiency has been already cleaned up (for example, by the 
introduction of ringer isolators), or else it will be done in the near 
future. Therefore, the distribution used was normal in dB (hence, 
log-normal in power, as described in Section III and Appendix D), 
truncated at 30 dBrnC, with a mean of (17.5 + gain) dBrnC, and a 
standard deviation of 14.85 dB. | 

For the noise fit parameter Ny, the value was 12.3 dBrnC;* for 
room noise, a normal distribution with a mean of 45 dBt (dB relative 
to 20 »Pa) and a standard deviation of 7 dB was used.! The conversion 
from dBt to equivalent dBrnC at the set terminals has been 


45 dBt > 11.5 dBrnC. 


Some tests indicate this to be a conservative estimate for mean room 
noise. However, it is not known whether its inherently different 
character may make it less masking than circuit noise, so that the 
conservatively low value of 11.5 dBrnC may be quite appropriate 
until further experiments are conducted. 


V. COMPUTATIONAL ASPECTS 


Conceptually, three main steps are involved. First, the ‘power 
convolution” of eq. (21) computes the noise power p.d.f. Second, this 
result is used in the (ordinary) convolution of eq. (13) to compute the 
p.d.f. of the intelligibility random variable. Finally, the crosstalk 
probability is easily obtained as the probability that this variable 
exceeds zero, eq. (14). 

Except for very simple p.d.f.’s, such as the normal distributions 
of V and Io, these calculations are very difficult to calculate by hand, 
yet very easy by computer. Consequently, a computer program 
implementing the calculations of Appendix C was used. It is thereby 
possible to calculate the p.d.f. of algebraic expressions of independent 
random variables and their differentiable, invertible transforms; the 
“power” transform is seen to be a simple special case in Section III. 
The convolutions involved were performed very conveniently by 
using a fast Fourier transform routine,® a method which is valid to the 
extent that the p.d.f.’s are well represented by the calculated discrete 
Fourier transforms. 


* No variance was modeled for this parameter; however, as can be seen from eq. 
(5), the parameter J does introduce variance due to subjective perceptual differences 
into the basic equation for the intelligibility threshold. 
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An advantage of the numerical, rather than analytic, approach 
is that the p.d.f.’s and transforms could be specified by any method 
expressible as a function, including the following: 


(t) Analytic form 
(iz) Table with analytic interpolation 
(277) Distribution of measurement values. 


Thus, it is possible to carry out an accurate calculation even when the 
data is distributed arbitrarily in a way that does not suggest a precise 
analytic representation. It should be noted also that well-behaved 
analytic functions can exhibit near-pathological behavior when trans- 
formed, as for example in Appendix D, in which case, particular 
attention to overall numerical accuracy is advisable. 

In short, crosstalk probabilities were easily calculated with the data 
of Section IV used in eqs. (21), (13), and (14), as a special case of 
Appendix C. 


VI. DISCUSSION OF RESULTS 


The crosstalk probability calculation involving the three main 
steps described in Section V was carried out for each value of vre of 
interest : 0, 4, 6, and 9 dB. Figures 5 through 8 illustrate the components 
and results of convolutions in the 4-dB and 9-bB cases. Figures 5 and 6 
will serve as an example for the discussion. 

The circuit noise p.d.f. with peak at 17.5 dBrnC + 4 dB, the room 
noise p.d.f. with peak at 11.5 dBrnC, and the convolved total noise 
p.d.f. including the hearing threshold of 12.3 dBrnC are shown in 
Vig. 5 for the range 0 to 30 dBrnC. It can be seen that this result is 
substantially truncated not only near 30 dBrnC, due to truncating 
the circuit noise there, but also near 12.3 dBrnC, due to the hearing 
threshold. The latter “truncation,” an evident result of the noise 
power sum, is significant in that it eliminates the possibility of extreme 
low-noise conditions that would otherwise contribute substantially 
to the crosstalk probability, or perhaps even dominate it. 

Figure 6 shows the convolution of the density of total noise N with 
the volume above intelligibility V — I, and with crosstalk coupling 
loss L. The resulting density is that of the intelligibility random 
variable R, where R > 0 signifies the event that intelligible crosstalk 
is heard. The probability of this event, as in eq. (14), is shown in 
Table I for various cases. Results for the three coupling paths identified 
in Fig. 2 are listed. The Next i — 1 case is the one analyzed so far; 
the probabilities of the FExT paths are comparatively small, as men- 
tioned in Section 2.1 and analyzed in Appendix A. 
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Fig. 5—Power sum of noise sources (gain = 4 dB). 


These results show how rapidly the probability rises above the 
insignificant trace at 0 dB once gain is increased, a feature clearly 
depicted in Fig. 9. Although a firm objective for crosstalk probability 
in loop plant has not been established, this should be certainly less 
than the 1-percent objective for trunk calls because the condition would 
be dedicated to a particular customer. A probability of 0.1 percent 
has been used as a criterion.! Here we note that this is reached near 
4 dB, and is greatly exceeded at 9 dB. 


Table |— Probability of intelligible crosstalk 


VEG NEXT 7 > | FEXT m > | FEXTi—~-n 
(dB) (%) (%) (%) 

0 <0.01 <0.01 <0.01 

4 0.14 0.02 0.03 

6 0.46 0.04 0.05 

9 2.09 0.09 0.18 
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Fig. 7—Power sum of noise sources (gain = 9 dB). 


The importance of extreme accuracy in calculation is now apparent. 
A calculation of the R distribution, which is “only” 99.9-percent 
accurate, could possibly rate the 4-dB case at twice its actual crosstalk 
probability. Figure 6 shows graphically how small the density is above 
R = 0. In the computer calculation, the precision can be increased 
as necessary by increasing the order of the fast Fourier transform, 
so the real limitation is the accuracy of the input data. In that, there 
is indeed room for further improvement, especially regarding the 
coupling-loss distribution, which is considered representative according 
to other measurements but was obtained from a single cable. The true 
masking effect of room noise should also be investigated further. For- 
tunately, more complete data are anticipated in the future, and it will 
be a trivial matter to recalculate the probabilities with this method. 
However, it is unlikely that the larger probabilities, such as for 9 dB, 
will turn out sufficiently smaller to be near 0.1 percent. 

Four points must be emphasized. First, the crosstalk calculations 
were made with respect to the connections in Fig. 2—reasonable but 
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Fig. 9—Total crosstalk probability as a function of gain. 


still arbitrary conditions; different conditions would yield different 
results. Second, the 0.1-percent criterion is intended to be applicable 
to plant that is fully dedicated to a particular customer. In Fig. 2, not 
all of customer l’s intraoffice calls would be to a community of interest 
served by vra. Therefore, provided that the proportion of cables at a 
co served by 6-dB gain is less than 20 percent of the cables without 
gain, the crosstalk probability experienced by the listener will be less 
than 0.1 percent. Since only 10 percent of the Bell System “long 
loops” (longer than 30 kft) exceed 60 kft, that degree of saturation is 
not considered likely; in this case, 6-dB gain is satisfactory. Third, 
the high noise level of long loops in the 1964 survey (truncated at 30 
dBrnC) was used in the calculations. It is likely that extreme noise 
cases have been and will continue to be mitigated, with the unfortunate 
side effect of ultimately increasing crosstalk intelligibility. Finally, 
even under the high circuit-noise level used, the crosstalk probability 
for 9-dB gain considerably exceeds even the trunk-call objective. 


Vil. CONCLUSION 


The subscriber loop vFe limitation due to probability of crosstalk was 
evaluated from available data on the random variables involved, 
including activity coincidence of disturber volume level, coupling- 
path loss, listener acuity, and various noise sources. Probability 
distributions which were nongaussian were represented in detail, and 
the necessary functions, sums, and functions of sums of random 
variables were calculated accordingly. The results, based on the best 
data available today, show that gain of 6 dB or less is acceptable and 
9 dB is considerably excessive; the desirability of further, more exten- 
sive, data is thereby indicated. The method of calculation is inde- 
pendent of specific assumed forms of the distributions of random 
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variables and, hence, anticipated to be convenient not only for similar 
crosstalk evaluations requiring high accuracy, but also for other non- 
gaussian probability problems. 
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APPENDIX A 
FEXT Calculation 


Consider the FExT coupling path shown in Fig. 2, from disturber 7 
to listener n. The calculation is perhaps somewhat subtler than that 
for NExT in the main study, because it is necessary to find the distri- 
bution of intelligible crosstalk probabilities that might be assigned 
to the listener, rather than a single value independent of assignment. 
For a fixed L*, eq. (11) may be written in terms of S = V — Ip — N, 


= Pe ["f(R+ Lear 
0 


= Ps [ fs(8)as 
When subscriber n is assigned his pair, he is in effect assigned the 


coupling loss LZ, from some distribution FY, and a resulting P%. 
Denote by F@ this distribution of crosstalk probabilities: 


Fe(C#) = P(PE s C*) 


; Cin 
= in —I1 i 
- plus z rn (1-%)] 


- re, |-r (1-$)]. (23) 


Moreover, the disturbed subscriber n will be subject to N — 1 such 
coupling paths. If the individual C™ are small, then the total crosstalk 
probability is approximately 

C* = ee Cin (24a) 


iAxn 








with density 
fo S fo Wai tle Bid Flee, TER (24b) 
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Fig. 10—Distribution of 1000-Hz rext coupling loss (dB). 


and distribution 


rn = [~ px(Cyac. (240) 


Thus, the desired distribution of FExT probabilities assigned to the 
subscribers has been determined. 

Instead of this calculation, some insight is often gained by evaluating 
(22) with L* having a fixed value at the worst 1-percent of the cou- 
pling-loss distribution. Several sets of past measurements show that the 
1-percent worst coupling loss is well estimated to be 84 dB at 6 kft. 
The loss at other lengths, including gain at the central office, 1s® 


l 


)+al- 4, 


where / is the cable length, and a is the attenuation per unit length. 
The lowest-loss application of 6-dB gain has been for 2000 ohms of 
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22-gauge cable. This amounts to 11.5 miles, at which length the loss 
is 9.3 dB.* Thus 
L*3(60.8 kft) = 77.24. 


Using this value, P?” evaluates as a negligible 0.01 percent. Similar 
estimates for other gain values are shown below. 

We now return to the exact calculation, eqs. (22) through (24). 
Within-unit FExT coupling-loss data similar to the NExT data of 
Section 4.2 is shown in Fig. 10. However, now the individual distri- 
butions F¥ are required, rather than their merged cumulative distri- 
bution as before. Therefore, the calculation is possible only at the price 
of assuming that the L* are independent and identically distributed 
according to Fig. 10. The resulting distribution of rextT probabilities 
is shown in Fig. 11 for 6-dB gain. It can be seen that the probability 
of assigning a 0.1 percent or greater FExT probability to a subscriber 
is less 0.1 percent. 
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Table Il — FEXT probability 


VFG Distributed Loss 1% Worst Loss 
(dB) (%) (%) 

0 <0.01 <0.01 

4 0.03 <0.01 

6 0.05 0.01 

9 0.18 0.03 


Similar calculations were performed for the other values of gain. 
Because it is somewhat unwieldy to compare distributions, the 1-per- 
cent worst assignment resulting from the distributed coupling loss is 
specified in Table II. The estimation based on 1-percent worst coupling 
loss is also shown; both indicate negligible probabilities. 

The FExT coupling path from disturber m to listener J also yielded 
small crosstalk probabilities. These results are summarized in Sec- 
tion VI. 


APPENDIX B 
Probability of Coincident Activity of Two Channels 

Channels a and b will be assumed to have calls initiated and termi- 
nated independently, with the simple assumption that the probabilities 
of these events during small time increments are proportional to the 
length of these increments. The quiet intervals and the call holding 
times will be shown to be exponentially distributed, as in Ref. 7 for a 
Poisson counting process. 

To derive the distribution of quiet intervals, let 
Po(a1, 2) = Pr{No call is initiated in [a, 22) | 

21 is in a quiet interval}. 

Assume that, as in a Poisson process, the probability of initiation 
during a time interval Az is proportional to an initiation-rate parameter 
1/ except for higher-order terms 0(Az), 


1 — Po(a, « + Az) = < Av + 0(A2) Ve 2 0, Ax 2 0. 
The probability of no initiation in [0, x + Az) is then 
P,(0, x + Ax) = Py(0, x) Po(a, x + Ax) 

P,(0, 2) E a : hs 0(ac) | 


O(Az) | 
Ax 


P,(0, xv + Az) ae P,(0, x) 


1 
Ar = — x Po(0, z) — Po(®, 2) 
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In the limit as Ar — 0, 


dP, (0, x) ares 1. 
a = *, P o(0, Zz); 
which has the solution 


P,(0, x) = e7=/s r= 0. 


IV 


The probability of initiation in [z,xv-+ Az), with no initiation in 
[0, x), is then 


Pi(a, 2 + Ax) = P,(0, x)[1 — Pola, x + A) ] 
= Po(0, 2) | 5 ax + 0(42) | 


This is the probability that the quiet interval after time 0 has a 
length between x and x + Az. Denoting the probability distribution 
function of these quiet interval lengths by F, 


AF (x) _ Pi(a, x + Az) 
Ac Az 


0(Az) 
Ax 





1 
— P(0, x) X = P,(0, x). 
The probability density function of these quiet interval lengths is the 
limit as Ar — 0, 


AF (x) _ ¥ Po, 2) = . enzlh x= 0. (25) 


Ax 





f(z) = lim 
Az—0 


Similarly, given that a channel is active at time 0, the probability 
density function of a call termination at time y is determined from the 
termination-rate parameter 1/y, 


1 

= eo ul, =0 
Suy = 44 . 
0, y <0 


(26) 


The mean quiet interval is 


[ afe(x)dx = in 7 ody =». 
a o A 


Similarly, the mean active interval is p. 
The probability to be derived is that of activity on channel b during 
some portion of a single call on channel a. This is most easily done 
by first calculating the probability of avoiding such coincidence. 
First, the probability that channel a initiates the call during one of 
b’s quiet intervals is 
d 


+d 20 
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Next, let y be the time interval from this initiation to the end of a’s 
call, and x be the time interval from this initiation to the end of b’s 
quiet interval. Then activity coincidence is avoided if 


ea: (28) 
Define 
2=%—Y, 
so that (28) is the event 
20: (29) 


The p.d.f. of z is 
fale) = fe fue) = [ folo)fale — 2)dev. 
From (25) and (26), 


f.(z) = : Xeon se e(2-2) lady, for 220 
z L 


at Me yet 1: e-2LA)+ Mw ge 
Au . 

= 1 gen, (30) 
A+ p 


From (27) and (29), the probability of avoiding coincident activity is 


1 — PY = Feaa [roe 


: 24 
— ab Sa —z/d 
‘ E ica: i 


2 
|: 


P® A : 
beak 

It should be noted that this is a conditional probability describing 
the event of coincidence given that channel a is active. In other words, 
this is the probability that b is active during any portion or portions 
of precisely one call by a. It is necessary to be aware of just how 
‘probability of coincident activity” is defined to avoid ambiguous or 
misleading results. 





~ +|> 





+ 


Thus, 


APPENDIX C 
Functions and Operator Sums of Independent Random Variables 


Let X be a random variable with p.d.f. f(X). Let u(-) be an in- 
vertible function such that wu and its inverse are differentiable. Then 
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the p.d.f. of 
Y = u(X) 


denoted by g(Y) is easily found as follows: 
Pr {y S Y} = Pr {u(@) S Y} = Pr {@ S u"(Y)} 


‘a f(X)dX 


(1) =a [remax =[ Bem] mem) en 


Now consider the problem of finding the p.d.f. of a random variable 
that is the inverse function of the sum of a function of several random 
variables, 


X= wfu(Xy) +--+ + ulX)]=wlS uXI], (82) 


with the p.d.f. of X; denoted by f;. The first step is to find the p.d.f. of 


Y; = u(X,a), (33) 
which is 
d 
a(¥) =| Zr | 91 (34) 
Then the p.d.f. of 
Y= 2» Y; (35) 
is the convolution 
g(Y) = gr * +++ *gn(Y), (36) 


and the p.d.f. of X = u—(Y) is, by applying eq. (31) with u(-) re- 
placed by u-!(-), 


d 
7%) = | fp we) | ou. (37) 
The calculations (32) through (87) are much more conveniently 


expressed by defining an operator +, analogous to +, to deal with 
the function u(-): 


(Xa) +u (Xo) A u[u(Xa) oe u(X») J. 


Similarly, the extended convolution *, is defined by eqs. (34), (36), 
and (87). Then all of (32) through (87) can be expressed as 


X= Xituc:: +u Xn 
F(X) = fru: *ufn 
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Section III illustrates the convenience of this notation in the case of 
power sums. Another important application is the multiplication of 
random variables, 


X = Xq-Xy = eleXatloex. = KX, +, Xp, 
where the function m(-) is defined as 
m(X) = log X, 
which is invertible and differentiable as required. Thus, 


F(X) = fa *mfe. 


Since both addition and multiplication are covered, it is possible 
to calculate in this manner the p.d.f. of algebraic expressions of 
independent random variables and their differentiable, invertible 
transforms. 


APPENDIX D 


Some Pathological Aspects of Power-Summing Random Variables with 
Large Variances 


When the variances of gaussian random variables in a power sum 
are small compared to the means, it is easy to see that the power 
function will not substantially distort their distributions. With p(-) 
defined as in Section III, 


Y = p(X) = 10*9 =X = g(Y) = p(Y) = 10 logwY. 


The term Y can be normalized with respect to its mean » and standard 
deviation o if the variance is finite: 


b Sea 
Co 





Ya 


X = G(Y) = 10 logio (u + cY) = q(Y). 


If N (X) is the normal c.d.f. of X, and G(Y) the c.d.f. of Y, by Appendix 
C they are related as follows, 


G(Y) = NLa(¥)]. 
Expanding around Y = ° 





q(Y) = q(0) +4 7 (OY with error < aA “4 yy? 
10 ¥o\ x, 
qg(Y) = 10 login + —— in 10. Tk Y with error < —~ ini0 (: ) ¥?: 


Thus, for |o/u| small q is approximately linear, and the c.d.f. of Y 
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may be expected to approach 


GY) = N (10 eee ae 
In 10u 
which is normal also. 
An entirely different situation exists when the variances are large 
compared to the means. As an example, consider the distribution of 
circuit noise N¢ from Section IV, which is normal in dB with p.df. 





Fe(Nc) = a Sore Soy, “a SATE. g SAS: 


Applying p(-) to find the density in units of power, 


10 
ge(We) = in i0 OW fc(10 logio We) 


10 1 1 


a5 ind qoe We exp[ — { (10 logins We — p)?/207} ]. 


To show that this density is extremely skewed, we find its maximum 
by differentiating, 


dgc(We) _ 10 1 1 (- a 7 
~dWe «In 10 Vaae E 5) (10 logio We — 2) 


(et wi) 7 wW 7 | exp [— { (10 logio We — u)*/207} J. 





This is zero at 
Wemax Ze 1 Q la-2? (in 10/10)]/10 — JQ 27.5-14.857 (In 10/10)}/10 — 0.0005. 


Now the median value of gc is at 
Wemea = p(w) = 1049 = 1017-510 = 87.5, 


The large difference between the mode and the median indicates the 
function’s skewness. 

A graph of g¢ up to the truncation point 80 dBrnC would show very 
little, but the salient features are depicted in Fig. 12, which has the 
scale broken and expanded at We = 10 and 0.003 so that some detail 
is visible. It can be seen that gc is less than 10 percent of the maximum 
over almost all of the range 0 to 1000.0, but rises sharply for values 
near zero. 

Because of the difference between the mode and the median of gc, 
it can be concluded that neither a normal density at the median nor 
an impulse at the maximum would be a very good approximation. 
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The former would underestimate the density at the low-noise tail, 
whereas the latter would overestimate it. 


REFERENCES 


1. D. H. Morgen, ‘‘Expected Crosstalk Performance of Analog Multichannel Sub- 
scriber Carrier Systems,’’ IEEE Trans. Commun., COM-23, No. 2 (February 
1975), pp. 240-245. 

. K. L. McAdoo, “‘Speech Volumes on Bell System Message Circuits—1960 Survey,” 
B.S.T.J., 42, No. 5 (September 1963), pp. 1999-2012. 

. T. L. Sen, ‘Masking of Crosstalk by Speech and Noise,’ B.S.TWJ., 49, No. 4 
(April 1970), pp. 561-584. 

. P. A. Gresh, ‘Physical and Transmission Characteristics of Customer Loop 
Plant,” B.S.T.J., 48, No. 10 (December 1969), pp. 3337-3385. 

. J. T. Peoples, personal communication, 1975. 

. Members of the Technical Staff, Bell Laboratories, Transmission Systems for 
Communications, Revised Fourth Edition, February 1970. 

; wor Davenport, Jr., Probability and Random Processes, New York: McGraw- 

ill, 1970. 


N Oo BP Ww NW 


CROSSTALK PROBABILITIES 903 


Copyright © 1976 American Telephone and Telegraph Company 
Tue Beutu System TECHNICAL JOURNAL 
Vol. 55, No. 7, September 1976 
Printed in U.S.A. 


Offset and Tilt Loss in Optical Fiber Splices 


By D. GLOGE 
(Manuscript received March 8, 1976) 


Transverse offset and angular misalignment (tilt) are sertous causes of 
loss in multimode fiber splices. Our computation of these losses in multi- 
mode graded-index fibers reveals that the loss depends strongly on the 
power distribution tn the fiber. We find that offsets of 0.1 of the core radius 
(or tilts of 0.1 of the fiber numerical aperture) cause a loss of 0.1 dB in 
the case of steady-state conditions, and between 0.384 and 0.88 dB tf the 
power distribution ts assumed as untform. We compare these results with 
measured offset loss values and conclude that the steady-state distribution 
better reflects transmission line conditions than the untform distribution. 


I. INTRODUCTION 


Fiber transmission lines will frequently have more than 10 splices 
between terminal points; to estimate the overall line loss, we need 
a reliable figure for the splice loss to be expected under varying line 
conditions. The loss in multimode fiber splices depends on the power 
distribution among the modes in the fiber. The difficulty of correctly 
simulating line conditions in laboratory measurements or computa- 
tions has so far prevented the establishment of a reliable splice-loss 
estimate. Calculations have predicted as much as 0.4 dB for a 10- 
percent relative offset (offset divided by the core radius). These calcu- 
lations assumed an equal or uniform power distribution in all modes.1 
Techniques used to measure the offset loss have attempted to simulate 
the equal-power condition by illuminating the full numerical aperture 
of the input fiber by a coherent source,!? but the resulting loss values 
were between 0.1 and 0.2 dB for a 10-percent offset. The question has 
been raised whether some of the discrepancies are attributable to 
leaky rays. 

The following study considers lateral offset and angular misalign- 
ment (tilt) of splices made from perfectly similar fibers by joining 
them without air gap. As an introduction to the subject, we derive 
simplified offset-loss formulae for power-law profiles, considering both 
the absence and presence of leaky rays. We find that leaky rays are 


905 


most likely not the cause of the discrepancies. The main part of this 
study concerns the misalignment loss in square-law fibers when the 
mode-power distribution is nonuniform as a result of mode-conversion 
effects. Special attention is given to the steady-state power distribu- 
tion.? We find that the discrepancies mentioned earlier can be ex- 
plained in terms of differences in the relevant mode-power distributions. 


Il. UNIFORM DISTRIBUTION 


To compute the offset loss, Miller! compares the number of rays 
that exit and enter at coinciding points of the two fiber end faces of 
an offset fiber joint (see Fig. 1a). The percentage of rays not accepted 
on entrance is the offset loss. As a first case, we assume all rays to be 
uniformly excited. Both fibers have the same dimensions and are of the 
same kind. In the case of step-index fibers, all rays traversing the 
overlapping core areas continue to propagate and the only loss occurs 
outside these areas. Offset graded-index fibers show a loss also in the 
overlapping areas where the local numerical aperture (LNA) of the out- 
going fiber is smaller than the corresponding LNA of the incoming 


OUTGOING 
FIBER) 


_INCOMING 
FIBER 





Fig. 1—Areas of transmission in a fiber splice. (a) End-on view of the splice. (b) 
Area of overlap rearranged for computation of offset loss. 
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fiber.* This condition exists for example in the right-hand half of the 
area of overlap in Fig. la. In this area, the outgoing fiber accepts only 
those rays that fall within its LNA; all others are lost. On the other 
hand—and this is important for our derivation—all rays in the out- 
going fiber are uniformly excited in this area since we have assumed 
that all rays of the incoming fiber were excited. 

In the left-hand half of the area of overlap, the incoming LNA is 
smaller than the outgoing LNA. Thus, all incoming rays are accepted 
there. On the other hand, the outgoing fiber is not fully excited. Since 
we assume that both fibers have the same characteristics, the number of 
incoming rays in the left half equals that of the accepted rays in the 
right half. If we arrange the two halves as in Fig. 1b, we find that the 
number of rays lost equals the number of those rays that traverse the 
blank area of Fig. 1b in a uniformly excited fiber. The width of this 
area is equal to the offset. 

Assume a fiber core of diameter 2a and define a radial coordinate r 
that is so normalized that r = 1 at the core periphery. Let n(r) be the 
core index and assume a constant index n(1) in the cladding. In the 
case of full and uniform excitation of all trapped modes, the rays 
traversing a point in the core are uniformly distributed over all angular 
directions within a cone that has the tna as its apex. The number of 
these rays is proportional to (LNA)? = n?(r) — n?(1). The integration 
of this quantity over the blank area of Fig. 1b, properly normalized, 
yields the offset loss. The integration can be performed analytically 
for step-index and parabolic profiles and leads to the results of Ref. 1. 

If the offset is a small fraction d of the fiber radius, we can approxi- 
mate the blank area by a rectangle of length 2a and width da. In addi- 
tion, we can ignore the variation of the LNA across the width of that 
area. These assumptions simplify the integration so that the offset 
loss becomes simply 


be fea e 
ee 


p= £28. (1) 
if [n?(r) — n?(1) Jrdr 
0 
If the index follows a power law 
n2(r) = ng(1 — 2Are) (2) 
in the core, the evaluation yields 
_ 2da+2 


* The una is proportional to the square root of the index difference at that point. 
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A comparison with the exact expressions for a = 1, 2, and shows 
that (8) represents the first term of an expansion of L; in powers of d. 
Remember that the result applies in the case of full and uniform excita- 
tion of all trapped rays, that is, those rays that are totally reflected 
inside the core. 

Leaky rays possess an evanescent cladding field in a region extending 
from the core periphery to a circle of radius A in the cladding.* The 
smaller A, the larger the radiative (leakage) loss 1(A). Stewart 
has found that the solid angle, which includes all rays having a loss 
smaller than /(A), equals the solid angle comprising the trapped rays 
divided by (A? — r*)?. To obtain an upper bound to the influence of 
the leaky rays on the offset loss, we assume A = a in the following 
although, by doing so, we include leaky rays that have a significant 
leakage loss. 

Arguments analogous to the previous derivation lead to an offset 
loss 

1 1 — r& 


_a@ ia © lees oy La 
PAS Fe 
rf as d=.) rdr 


for a uniform power distribution in both trapped and leaky modes. 
We obtain 


dr 
(4) 


ee Gok 
51 ae YD (5) 


K = 2¢%I (5) vi T'(a). (6) 


Table I lists loss values for some exponents a. Figure 2 shows loss 
versus offset for a = 2. In that case, an offset of one tenth of the core 
radius causes a loss of 8.5 percent or 0.38 dB if only the trapped 
modes are considered and 7.5 percent or 0.34 dB if all leaky modes 
are taken into account as well. The discrepancy is small. 

Equations (1) and (4) are based on the assumption that at least 
one of the fibers of the joint is fully excited at any given point in the 
splice cross section. This assumption requires some scrutiny in the 


Ly; = 


with 


Table |— Offset loss coefficients 
a 1 2 4 re) 
L,/d 0.95 0.85 0.76 0.64 
Li/d 0.85 0.75 0.67 0.5 
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Fig. 2—Offset loss vs relative offset for uniform power distribution according to 
eqs. (8) and (5). 


case of the leaky modes. In this case, the cone of rays traversing a 
given point in the core becomes elliptical, with the major axes oriented 
in radial and tangential direction. The offset juxtaposes cones with 
slightly different orientation. For (4) to hold, the larger of the cones 
must completely overlap the smaller cone. Without presenting details, 
we merely mention here that this condition is not satisfied, but that 
the resulting error is within the linear approximation in d. 


ltl. NONUNIFORM DISTRIBUTIONS 


Consider a ray that propagates in a fiber that has an index profile 
of the form (2). Assume that the ray forms an angle @ with the fiber 
axis at a distance ra from the axis. The principal ray optics variables 
r and @ have no physical meaning in the wave optics picture; the 


combination 
sin? 6 \2 





however, is directly related to the propagation constant of a fiber 
mode.* Given the power in each fiber mode, we can plot the power 
distribution p(R). Although the following computations can be per- 
formed for general power-law profiles, we restrict the derivation to 
the square-law profile in the interest of simplicity. The results for 
near square-law profiles are very similar. We introduce a normalized 


“FR? = (1 — 6/k’n§)/2A, where @ is the propagation constant and k the vacuum 
wave number. # is the magnitude of a vector coordinate in phase space pointing to 
the locus of the mode. We call R, therefore, the mode coordinate. 
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angular coordinate 
| p = sin 0/(2A)!, (8) 


so that, for square-law profiles, 
RE =r? + pt (9) 


The fact that r and p are interchangeable in this relationship implies 
that offsets and tilts can be treated in the same way.* More specifi- 
cally, in case of a tilt angle 6 and a lateral offset d, we can introduce a 
single offset variable D so that 


D? = @ + sin? 6/2A. (10) 
Before we calculate the offset loss, let us compute the total trans- 
mitted power P for a given power distribution p(R). We note from 
(9) that pdp = RdR and that p(R) = 0 for r > R. Hence, 
1 R 
P= i p(R)2nRdR I Qn rdr (11) 
0 0 
or 
1 
P = 29? / p(R) RR. (12) 
0 
If the power distribution depends on the angle ¢, as indicated in Fig. 3, 
we obtain 


P 


he i RMR / "oR, by sintwede (13) 
0 0 


by introducing rdr = ydz with x = rcos¢ and y = rsin ¢ in (11). 

Now consider a lateral offset d. Assume the power distribution in 
the incoming fiber to be q(R,) for R; < 1 and zero everywhere else. 
If we retain only linear terms of d as before, R; = R — d cos ¢, where 
R is the mode coordinate of the outgoing fiber. The power distribution 
in terms of this coordinate becomes 


2 arcos[(R—1) /d]J : 
p(k) = = / q(R — d cos ¢) sin’ ¢d¢. (14) 
0 


This expression is a modification of the second integral in (13), which 
takes into account that g = 0 for ¢ > arcos[(R — 1)/d]. A similar 
derivation holds for a tilt angle 6 or a combination of offset and tilt. 
The result has the form (14) with d replaced by D of (10). It is con- 
venient to normalize q(R) for unit total power P; this is done in the 
following analyses. 


*The modes of square-law fibers occupy a phase-space volume of rotational 
symmetry. 
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Fig. 3—Coordinate system used to compute offset loss. 


As a first example, assume a uniform input distribution 


2/1 for R;<1 





q(Ri) = 0 for R;> 1. (15) 
Because of (14), the distribution in the splice becomes 
a” R-1 2R-1 _ i= DPT 
pi(R) = = arcos D = pe "oa E oe | (16) 


Figure 4a shows q; and p; in the vicinity of R = 1 for D = 0.1. The 
offset loss is indicated by the cross-hatched area in Fig. 4a. It is 


1+D 
Le oe iA ni(R) RMR = = (17) 


in accordance with (8), which yields L; = 8d/3z for a = 2. 

Power transfer among the modes as a result of guide imperfections 
modifies the step distribution (15). If we assume uniform excitation 
at the input and model the power exchange by a diffusion process, 
we find that the step function (15) assumes the form 


1 — exp[(R; — 1)/0.17 (z/z.)*] (18) 


a short distance z from the input. In this expression, z, is the coupling 
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Fig. 4—Power distribution immediately ahead of the splice (g) and immediately 
behind the splice (p) for various power distributions plotted vs the mode coordinate 
FR. The cross-hatched areas indicate splice loss; D = 0.1. 


length.** For our purposes, the approximation 


1 for Ri; <1l-o 
_ 2(1 a 2c) (1 — R,)/o for 1—o<R;<1 (19) 
a 0 for Re St 


is satisfactory and valid as long as 
= 0.17 (z/z.)3 (20) 


is small compared to unity. Figures 4b and c show plots of qe for 
o = 0.05 and o = 0.1 as well as the distribution p2(R) calculated from 
(14). An integration of the cross-hatched areas similar to (17) yields 
the loss Le. Although an analytical solution exists, we list here only the 
simpler approximate form 


2D 3 8a \" 

In = 72 +20) ( pap + 2H) ey 
which is more useful for design consideration. Figure 5 shows LZ, and 
L, for ¢ = 0.05 and o = 0.1 plotted vs the offset D. If, for example, 
z- = 200 m, a power distribution resembling that of Fig. 4b (¢ = 0.05) 


*The coupling length can be obtained by measuring the impulse response at a 
fiber length where the steady state has been reached. The coupling length is then equal 
to that length, multiplied by the square of the rms pulse width measured, and 
divided by the square of the rms width expected without mode conversion. 
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would exist 17.3 m from the input; similarly, a distribution resembling 
that of Fig. 4c would exist 34.6 m from the input (provided the input 
excitation is uniform). A 10-percent offset (or tilt) at these points 
would cause a loss of 0.14 dB or 0.09 dB respectively compared to 
0.38 dB if the distribution were uniform at the splice. The reason for 
the drastic decrease in offset loss can be found in the fact that the 
modes close to cutoff suffer the main part of the offset loss, but cease 
to carry a significant portion of the total power a short distance away 
from the (uniformly excited) input. 

For o > D, the last term in (21) dominates. The loss can then be 
written in the form 


T 7 
L= "gps, (22) 


where q’(1) is the derivative of g with respect to R at R = 1. In this 
case, the loss depends on the square of the offset and L(D) has zero 
slope at D — 0. 

As 2 approaches z,, the power distribution in the fiber resembles 
more and more the steady-state distribution.? Properly normalized, 
this distribution assumes the form 


us J o(uR)/J1(u) for R <1, 


13 = 952(u2 — 4) 0 for esd. 


where J» and J; are Bessel functions and wu = 2.405 is the first root of 
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Fig. 5—Offset loss in dB vs relative offset or tilt for the four mode-power distribu- 
tions sketched in Fig. 4. 
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Jo. Figure 4d shows qs; as well as the distribution p; in the vicinity 
of R = 1. The fraction of p; that extends beyond R = 1 indicates the 
loss in the splice; it can be obtained from (22) by computing q;3(1) 
with the help of (23). The resulting loss Z; is also plotted in Fig. 5. 
The part of the power distribution p; that deviates from the steady- 
state qg3 in the region R < 1 is accepted by the outgoing fiber, but is 
dissipated later as a result of mode conversion until the steady state 
is reestablished. An integration of p3; — q3 analogous to (17) but ex- 
tending from R = 1 — D to R = 1 reveals that this loss contribution 
is equal to the loss suffered in the splice. Thus, the total loss can be 
obtained by inserting q3(1) into (22), as mentioned earlier, and by 
doubling the result of this integration; the total loss caused by an 
offset under steady-state conditions is, therefore, 


utD? 


16= 39a) 


(24) 
This loss is plotted in Fig. 6. A 10-percent offset or a tilt angle equal 
to one tenth of the fiber Na causes 0.1 dB loss compared to 0.38 dB 
if the power is uniformly distributed in all trapped modes or 0.34 dB 
if leaky modes are present as well (Fig. 2). 
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Fig. 6—Steady-state loss vs relative offset. Points show loss measured for offset 
between two 4-km fibers. 
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IV. COMPARISON WITH MEASUREMENTS 


Figure 6 also shows loss values measured in a splice connecting two 
3-km fibers. Details concerning the measurement are presented in 
Ref. 1. The coupling length characterizing these fibers was not mea- 
sured, but was probably of the order of the fiber length. The fact that 
the loss values are consistently lower than the predicted steady-state 
loss may indicate that the 4-km length was not quite sufficient for 
the steady-state power distribution to establish itself before and after 
the splice. 

Figure 7 shows loss values measured in a splice connecting two 3-m 
fiber pieces.2 The fibers had an outside diameter of 125 ym, a core 
diameter of 75 wm, and an Na of 0.155. They were part of a cable struc- 
ture that may have introduced some mode conversion. The fiber input 
was excited by a HeNe laser. Focused on the core, the gaussian laser 
beam filled a numerical aperture of 0.25 (at 1/e? intensity). The solid 
line in Fig. 7 is a repetition of Ly for c = 0.05 plotted in Fig. 5. Accord- 
ing to (20), this curve corresponds to the distribution present at a 
distance of 8.6 percent of the coupling length from the (uniformly 
excited) input. 

It is beyond the purpose of this paper to determine under what 
circumstances a coherent source produces a uniform power distribution 
at the fiber input. The results depicted in Fig. 7 suggest that the use 
of a laser beam for splice loss measurements is acceptable as long as the 
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Fig. 7—Loss measured for offset between two 3-m fibers; solid line applies to 
conditions of Fig. 4b. 
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fiber Na is adequately filled by the beam and a modest amount of mode 
conversion is present (or applied) between input and splice. Measure- 
ments under the conditions described yield loss values that are about 
50 percent too high at 10-percent offset and about equal to the steady 
state at 20-percent offset (or tilt). 


V. CONCLUSIONS 


We have attempted to show that the assumption of a uniform power 
distribution among all modes (or rays) is an idealization rarely ob- 
tained in fibers. The power distribution and particularly its behavior 
at the critical angle strongly affect the offset loss. If the offset loss 
calculation is based on the steady-state distribution, the loss at 10- 
percent relative offset (or tilt) is 3.8 times smaller than that predicted 
for a uniform distribution. Although the steady-state distribution does 
not necessarily exist at all splice points, we believe that it reflects 
transmission line conditions much better than a uniform distribution. 
We have compared these results with measurements by Miller! and 
Chinnock? and found that coherent excitation can provide offset loss 
values representative of line conditions if care is taken that the source 
fills the na of the fiber and a modest amount of mode conversion is 
effective ahead of the splice point. 
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Transmission vs Transverse Offset for 
Parabolic—Profile Fiber Splices With 
Unequal Core Diameters 
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A geometrical optics model 1s presented that 1s based on defining a 
local numerical aperture as a function of fiber radius with a uniform 
power distribution. Transmission vs transverse offset characteristics for 
parabolic-profile fiber splices are calculated for unequal fiber-core diame- 
ters. We show that the often-used assumptions of equal-mode excitation, 
equal-mode attenuation, and no-mode coupling are not adequate to calcu- 
late realistic transmission vs offset characteristics. Splice-loss measure- 
ments with long fiber lengths on each side of the splice show less than the 
calculated sensitivity to small offsets and greater than the calculated sensi- 
tivity to large offsets. 


]. INTRODUCTION 


Accurate transverse alignment is difficult to obtain in fiber optic 
splices primarily due to the small size of optical fibers. Many splicing 
techniques!.?:3 use grooves or channels and are dependent on fiber outer 
diameter (op) for transverse alignment. Experience thus far in- 
dicates that op variations as small as +1 percent can be achieved on 
a single long fiber; however, fiber-to-fiber variations for fibers drawn 
at different times may be considerably greater. Small fluctuations in 
core-diameter-to-op ratios are also expected. When the receiving 
fiber core is smaller than the transmitting fiber core, a loss occurs even 
with perfect transverse axial alignment. 

Thiel‘ and Henderson? have reported transmission vs offset for equal- 
core-diameter, step-profile fiber splices and Pugh® has considered core- 
diameter ratios less than one (receiving fiber smaller than transmitting 
fiber) for step-profile fiber splices. Calculated values for transmission 
vs offset for equal-core-diameter, parabolic-profile fiber splices have 
been obtained earlier by the author.’ 


917 


Transmission vs offset for parabolic-profile fiber splices with unequal 
core diameters is calculated in this paper. In addition, transmission vs 
offset-measurement data for equal-core-diameter, parabolic-profile 
fiber splices are presented. We found that the theoretical model gives 
pessimistic results for the equal-diameter case for the region of primary 
interest (offsets less than 0.8 core radius). Measured transmission is 
less than the model for offsets greater than 0.8 core radius. 


Il. ASSUMPTIONS 

The solid angle defined by the local numerical aperture (Na) of the 
fiber at every point on the core is assumed to contain a uniform power 
distribution. This assumption is consistent with equal-mode excitation, 
equal-mode attenuation, and no-mode coupling.* Consider the profile 


nO SH E = 2a (5) | for r<R, (1) 


where 
A = (no — 1¢)/70 1S small. 
refractive index at center of core. 


no = 

ne = refractive index of cladding. 

a = a parameter between 1 and ©. 
R = fiber core radius. 


The na as a function of radius is then 


na(r) = [a(r)? — nf J}, (2) 
. wa (0) & nov2 | 1 oa (3)]: (3) 


Since uniform power per unit solid angle is assumed transmitted for 
each incremental area of the core within the angle 


o(r) = sin™ [Na(r)/n(r) J, (4) 
the total power, P7, equals 


Pr & P(0) ia i. E = (3) rdrd®, (5) 


Pr = tR’aP(0)/(a + 2), (6) 


or 


where P(0) is a constant dependent on the input power A and 7p. 
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When the fiber cores are offset, the total power received at a given 
point is assumed to be limited by the transmitting or receiving Na, 
whichever is the minimum at that point. 


Ill. GENERAL APPROACH FOR THE PARABOLIC PROFILE (a = 2) 


For the case of equal fiber-core diameters, the locus of equal Na is a 
straight line, as shown in Fig. 1. The na function is integrated over the 
area of overlap bounded by a circle and a straight line. The resulting 
integral for received power in region I, where the transmitting fiber has 
the minimum NA, is 


cos! (d/2RT) Rr yr \2 
Pp = 2P(0) I / [1 zs (xz) rdrd8. (7) 
0 d/2cos 8 Rr 


For region II, where the receiving fiber Na function has the minimum 
NA, a positive translation of the na function must be made. The re- 
ceiving Na function, translated an amount d and referenced to the 
center of the transmitting fiber, equals 


i 
Four separate cases are considered to calculate transmission vs off- 
set for unequal core diameters. If K is the ratio of the radius of the 
receiving fiber to the radius of the transmitting fiber, K = Rr/Rr, 
and d = offset, then the four cases are as follows: 


y 


REGION If REIN I 
1 








EQUAL NA 


~ 
~~ RECEIVING 


a 
TRANSMITTING FIBER CORE 


FIBER CORE 


Fig. 1—Regions of overlap for offset fiber cores (Rr = Fp). 
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Case I 


See Fig. 2. d/Rr < K — 1for K > 1. Core boundaries do not inter- 
sect and Rr > Rr. 


Case II 


See Fig. 3. d/Rr > K — 1 for K > 1. Core boundaries intersect 
and Rr > Rr. 


Case III 


See Fig. 4. d/Rr < 1— K for K < 1. Core boundaries do not inter- 
sect and Rr > Rr. 


Case IV 


See Fig. 5. d/Rr > 1 — K for K < 1. Core boundaries intersect 
and Rr > Rr. 


For Cases I and III, areas of overlap are circles and the integrals 
are easily written and solved for the transmission. The resulting equa- 
tions are shown on Figs. 2 and 4. For Cases II and IV, some areas of 
overlap are bounded by arcs of circles, and integrals for these areas 
are tedious to solve. The approach used for these cases is to divide 


K=Rp/Ry 

a=d/(1— K) 
w = d/(1 + K) 
u = (a ~ w)/2 


q=w-u 





cL 
2 


q 
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Fig. 2—d < Rr — Rr. 
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Fig. 3—d > Rr — Rr. 


the areas of overlap into sections which are bounded by a circle and a 
straight line and to combine these sections to obtain the required 
geometry. All integrals then have the form suggested by eqs. (7) and 
(8) and can be segmented into the following three integral forms. 


cos! (a/b) b 2 
i J rdrd@ = Z cost $ — 5 ve — a’. (9) 
0 a 


/cos 8 2 


cos! (a/b) b 
; i r? cos 6 drd@ = 4 V(b? — a)’. (10) 
0 a 


/cos 6 


cos! (a/b) b 4 
/ / r drd@ = Z costs — 2 y (b? — a?)8 
0 a/cos 6 b b 12 
3 

— ra vb? — a. (11) 
These integrals were computer-programmed and summed to calcu- 
late transmission vs offset. Closed form expressions were not obtained 
due to the large number of terms involved. The combination of in- 


tegrals for cases II and IV is given in the Appendix. 
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Fig. 4—d < Rr — Rr. 


IV. RESULTS OF CALCULATIONS 


Figure 6 is the resulting plot of transmission (percent) vs offset 
(transmitting fiber-core radius) for K = 1.05, 1.00, and 0.95. Also 
shown are results calculated for the step profile for K = 1.00 and 0.95. 
These calculations show a 33-percent greater sensitivity to offset for 
the parabolic profile and approximately the same sensitivity to K as 
the step profile. Figure 7 is a family of curves for small offsets. For zero 
offset, the transmission equals 1 for K > 1 and equals K? for K < 1, 
as in the step profile. For offsets less than Rr — Rr for K <1, the 
parabolic profile exhibits a dependence on offset, whereas the step 
profile is constant. The reduction in transmission due to this effect is 
only 0.7 percent at 0.05 core-radius offset for K = 0.95. The slope 
discontinuities in the parabolic case in Fig. 7 result from the 0.01 core- 
radius offset increment selected. Curves for transmission vs offset are 
smooth except at the point d = Ry — Rr, where a discontinuity in the 
slope may exist. 
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Fig. 6—Calculated transmission vs transverse offset. 
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Fig. 7—Calculated transmission vs transverse offset for small offsets. 


V. COMPARISON OF CALCULATIONS AND MEASUREMENTS 


Several investigators have seen indications®° that the sensitivity 
to small offsets is significantly less than the calculated values for para- 
bolic fiber splices. These measurements®!° were made with short fibers 
on each side of the splice and the launched mode distribution probably 
differed greatly from the uniform power distribution assumed here. 

A measurement of transmission vs offset was made with 500 meters 
of Corning fiber on each side of the splice subsequent to measuring the 
100-percent transmission level. A helium-neon laser with an expanded 
beam and a 20X objective lens were used to completely fill the fiber 
NA. Two tabs were placed near the center of the 1-km parabolic caw 
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fiber, about 4 inches apart, and the fiber was scored and broken be- 
tween these tabs. A single break was made and the same ends were 
realigned on a microscope slide against a thin straightedge with the 
tabs used to obtain rotational alignment. Glycerin was used as the 
index-matching material and a cover slip held the ends in position 
under a microscope. The 100-percent level was again obtained and 
transverse offset was introduced in the splice and photographed. 

Figure 8 shows the result of the transmission vs offset measurement. 
The sensitivity to offset for small offsets was less than predicted by 
theory, and greater than predicted for large offsets. The crossover 
point is approximately 0.8 core radius. For small offsets the reduction 
in sensitivity to offset is approximately the same as one would expect 
for a 10-percent oversized receiving fiber (K = 1.1). 

The use of the geometrical optics approximation has been corrobo- 
rated by comparison with the wave analysis of Marcuse.!® The con- 
clusion is that the assumption of equal-mode excitation, equal-mode 
attenuation, and no-mode coupling is not adequate to calculate splice 
transmission sensitivity to offset. The calculated transmission with 
core-diameter mismatch shows trends that can be used to obtain rela- 
tive effects only. The long fiber length measurements of transmission 
vs offset can be used as design data for setting required alignment 
tolerances. 
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Fig. 8—Comparison of calculated and measured transmission vs transverse offset. 
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Current work by Gloge™ indicates that the assumption of a sharp 
cutoff of power at the boundary of the solid angle defined by local Na 
could cause the discrepancies noted here. The model used in this 
paper is extended by Gloge to include a nonuniform power distribution 
across the solid angle defined by the local Na. 
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APPENDIX 
Integrals for Cases Il and IV 
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Laminated Fiber Ribbon for Optical 
Communication Cables 


By C. M. MILLER 
(Manuscript received February 26, 1976) 


Laminated fiber ribbon (LAMFIR) exhibits mechanical properties of a 
composite structure and is strong enough to withstand the stresses of cable 
fabrication. Laminating parameters have been adjusted to make the LAMFIR 
relatively easy to separate for accessing the fibers. Optical properties of 
LAMFIR have been characterized, so that trade-offs between numerical aper- 
tures (NA), cladding thickness/core diameter (t/d), and additional loss are 
given for uncoated, step-profile fibers and one set of laminating conditions. 
Larger values of t/d than originally expected are desirable due to the 
bends in the fiber axes. 


I. INTRODUCTION 


The minimum loss obtainable in high-quality optical fibers has 
been steadily decreasing over the years with losses of less than 2 
dB/km at 1.06 um now being reported.! These advances along with 
those in the area of sources and detectors indicate that the basic 
components needed to build digital fiber links capable of operation in 
the tens or perhaps even hundreds of Mb/s range are becoming 
available. Combining a plurality of fragile optical fibers with adequate 
packaging to form a rugged communications cable is needed before 
fiber systems become practical. 

The ribbon structure (linear array) for packaging optical fibers 
was suggested by Standley? and others of Bell Laboratories. This 
linear array structure is attractive from the splicing standpoint since 
groups of fibers can be handled at once. The “essentially two-dimen- 
sional”? nature of fiber ribbons relaxes the alignment requirements 
needed to accomplish mass field splicing of optical cables. This is 
advantageous for multifiber splicing. In addition, the ribbon provides 
mechanical support and increased bulk, thereby greatly improving 
the “handling qualities’ of individual fibers. 

Other fundamental requirements of any ribbon structure are as 
follows: 
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(z) The ribbon structure must protect the fibers against breakage 
due to handling and must have an adequate service life. The 
mechanical strength and abrasion resistance must be sufficient 
to withstand forces generated in the cabling operation as well as 
the cable handling required to emplace these cables. 

(2t) The optical properties of the fibers must not be significantly 
degraded by the ribbon structure or the subsequent cabling 
operation. 

(zat) The ribbon structure must be strippable in order to expose 
the fibers for splicing or terminating to sources or detectors. 
Alignment accuracy of fibers within the ribbon and relative 
to the edge of the ribbon will probably be dictated by splicing 
requirements. 

(wv) The ribbon structure must be cconomically manufacturable. 


This paper reports on a laminated fiber ribbon (ZAuFrR) which 
has strong bonding between the fibers and the ribbon structure. As 
shown in Fig. 1, a composite polyester-polyethylene tape forms each 
side of the structure with the fibers between them. At the polyester- 
polyethylene interface, the ribbon must be peelable in order to gain 
access to the fibers. 


Il. FABRICATION OF LAMFIR 


A commercially available roll laminator, designed to plastic-en- 
capsulate identification cards, was modified to fabricate LamFIR. The 
additions to the original machine include (2) a fiber guide to provide 
uniform fiber separation, (72) a cutter to trim away excess tape ma- 
terial, (277) a more accurate temperature controller, and (zv) modified 
gearing to increase the laminating speed. As shown in Fig. 2, two 
supply reels feed the composite polyester-polyethylene material over 
heated rollers which soften the polyethylene. The two heated tapes 
are brought into contact under pressure and the fibers are sandwiched 
in polyethylene which serves as the adhesive. Forced-air cooling is 
applied after the pressure rollers. Power rollers pull the laminated 
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Fig. 1—Cross section of a laminated fiber ribbon. 
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Fig. 2—Laminated fiber ribbon fabrication. 


material through the slitting station, with tension provided by the 
supply reels. 

The polyester maintains dimensional stability (higher melting 
temperature than polyethylene), while the polyethylene is softened 
by application of heat. At the proper temperature and pressure the 
polyethylene flows sufficiently around each fiber to form the cross 
section shown in Fig. 1. 


lll. MECHANICAL PROPERTIES 


Mechanical strength tests have been performed on the LAMFIR 
structure.? The purpose of these tests was to determine how the 
structure modifies the mechanical properties of individual fibers. 
Tensile tests were made on two-foot samples with individual fiber 
breaks being detected in situ by an optical technique. The results 
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of these tests? on the LAMFIRs containing different numbers of fibers 
are shown in Fig. 3. The output of an optical sensor indicates that 
the first break in the straight portion of the curve corresponds to 
the load at which all the fibers have broken. It is seen that the load- 
bearing capacity of the fibers, as well as the slope, increases with 
the number of fibers. A comparison of the curves for the LAMFIR 
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SEVEN-FIBER LAMFIR 


SIX-FIBER LAMFIR 


FIVE-FIBER LAMFIR 


FOUR-FIBER LAMFIR 
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LOAD IN POUNDS 
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LAMFIR WITHOUT ANY FIBERS 





TIME IN MINUTES 


Fig. 3—Strength of a LAMFIR. 
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composites with the curve for LAMFIR containing no fibers shows 
that the fibers indeed carry the major portion of the load. After 
all of the fibers had fractured, the load did not decrease to the load- 
bearing capacity of the plastic alone. The fibers broke at different 
locations with respect to the tape, and the ribbon now behaves like 
a mechanical composite containing discontinuous fibers. Well-bonded 
discontinuous fiber composites react to applied loads by locally trans- 
ferring stress through the matrix across the fiber discontinuities.’ 

From a mechanical standpoint the LAMFIR structure performs well. 
Rudimentary cables fabricated with LAMFIRs preserve the composite 
character of the ribbons. Tensile loads comparable to those anticipated 
in pulling long cables into ducts can be withstood without fiber 
breakage. Should additional strength be required, load-bearing fibers 
of graphite, Kevlar 49,* or glass could perhaps be guided into place 
during lamination to increase the strength of the ribbons. Load-bearing 
elements might also be embedded in the cable sheath‘ or elsewhere if 
additional strength is needed for pulling cables into rough or tight 
(high coefficient of friction) ducts. 


IV. OPTICAL PROPERTIES 


Initial LAMFIRs exhibited exceedingly high losses (hundreds of dB/ 
km). It was found that random bending of the fiber axis caused this in- 
creased loss. The random bending was caused by shrinkage in the poly- 
ester component of the composite film after lamination during cooling 
of the polyethylene. A thicker polyester component which was oriented 
longitudinally to reduce shrinkage greatly reduced the additional loss. 

The fiber cladding thickness is another factor that greatly affects the 
loss due to bending in the presence of a lossy jacket. The cladding 
optically isolates the fiber core by providing a low-loss medium for 
the exponentially decaying evanescent field. If the cladding is too 
thin, this evanescent field has appreciable intensity at the boundary 
between the cladding and, in the case of LAMFIR, the polyethylene 
which is lossy at optical frequencies. When bending of an optical 
fiber occurs, the transverse field extends even further into the cladding 
on the outside of the bend.® 

Increased mode coupling due to random bending may also be 
contributing to the added loss.* This mechanism may result in a 
continuous transfer of energy to the lossier high-order propagating 
modes and to radiating modes. Either effect, inadequate cladding 
thickness or increased mode coupling, can significantly increase the 
loss of fibers. 


* Trademark of E. I. duPont de Nemours and Company, Inc. 
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A set of 12 ribbons was fabricated with the emphasis on uniformity 
of ribbon geometry and repeatability of laminating conditions. All 
conditions during ribbon fabrication were held as constant as possible 
at a set laminating temperature and a constant laminating pressure 
supplied by pressure rollers. Additional loss vs cladding thickness/core 
diameter (t/d) for various fiber numerical apertures (NA) are shown 
in Fig. 4. All data points follow the dashed curves which indicate the 
trends in the data. Conclusions based on these particular laminating 
conditions (not necessarily optimum) are as follows: 


(z) Additional loss decreases with increasing t/d and increasing 
NA as expected. 

(72) Three-mil op fibers consistently exhibit higher additional 
losses than 5-mil op fibers with the same t/d and na. The 
amplitude and spatial frequency spectrum of the bends depend 
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Fig. 4— Additional loss in a LAMFIR structure. 
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on the fiber stiffness. Five-mil fibers are much stiffer and 
probably assume bends with smaller amplitudes or longer 
periods (or both) as compared with 3-mil fibers. 

(iz) A relatively high t/d value is needed for the LAMFIR structure. 
A t/d ratio approximately equal to 0.5 is required for 5-mil 
fibers at NA = 0.21 and probably t/d 1.3 for 5-mil fibers 
with na = 0.15 for less than 2 dB/km added loss. The t/d 
parameter was selected because added loss increases with d 
and decreases with t. Since the powers of these two dependences 
may differ, loss may not be a single-valued function of t/d for a 
wide range of t and d. It is unclear which loss mechanism is 
responsible for the added loss, and indeed, inadequate cladding 
thickness may be important at small values of t/d while 
mode-coupling effects should dominate at large values of t/d. 

(wv) Excellent repeatability of the data is indicated by the cluster 
of points with na = 0.15 in the interval 0.5 < t/d < 0.6. 
These fibers were all in different ribbons. 
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Microbending Losses of Single-Mode, 
Step-Index and Multimode, 
Parabolic-Index Fibers 


By D. MARCUSE 
(Manuscript received January 16, 1976) 


We present formulas for the microbending losses of fibers that are 
caused by random deflections of the fiber axis. We consider single-mode 
(or almost single-mode), step-index fibers and multimode, parabolic-index 
fibers and compare their losses. Loss formulas for the single-mode fiber 
are derived from coupled-mode theory using radiation modes. Simple 
empirical approximations of the general formulas are also presented. The 
losses of the parabolic-index, multimode fiber have been derived earlier. 
The losses of both fiber types are compared by assuming etther that each 
jiber samples the spatial Fourier spectrum of the distortion function at the 
same spatial frequency, or by comparing typical fibers of each type with 
each other regardless of any similarity between them. It 1s found that the 
multimode, parabolic-index fiber has lower losses if it supports a sufficient 
number of guided modes. 


I]. INTRODUCTION 


Two types of optical fibers appear to be promising for wideband 
communications, the monomode step-index fiber and a multimode 
fiber with parabolic (or nearly parabolic) refractive index distribution.! 
A third fiber type, the W-fiber,? is so similar to the step-index fiber that 
its properties are easily included in the discussion of single-mode fibers. 
However, single and multimode fibers suffer radiation losses caused by 
unintentional random bends of the fiber axis.*4 Thus, it is of interest 
to compare the two types of fibers and investigate which of them is 
better from the point of view of microbending losses. This loss com- 
parison is the objective of this paper. 

Radiation losses of optical fibers are caused by coupling between 
the guided modes and radiation modes.* Two modes with propagation 
constants 6; and 2 are coupled via a spatial frequency 6 of the coupling 
function when* 

[81 — B2| = 0. (1) 
937 


A guided mode with propagation constant 8, couples directly to the 
radiation modes using the entire spatial frequency spectrum in the 
range 

Bg — nok < 6 < By + nok (2) 


(nek is the plane-wave propagation constant of the cladding). How- 
ever, most practical, randomly bent fibers have spatial Fourier spectra 
that drop off very rapidly with increasing spatial frequency so that 
only a narrow region of the spatial frequency spectrum near 


6=68,-—nk = (3) 


is actually responsible for radiation losses in single-mode fibers. 

In parabolic-index, multimode fibers, the random curvature of the 
fiber axis causes coupling among the guided modes. Adjacent guided 
modes in parabolic-index fibers have almost equal spacings in @ space, 
so that only a very narrow band of spatial frequencies is instrumental 
in coupling the guided modes among each other.* Since most practical 
Fourier spectra drop off rapidly with increasing spatial frequency, we 
may assume that only the guided modes immediately adjacent (in 6 
space) to radiation modes couple directly to radiation. Radiation losses 
in multimode fibers are thus less direct than in monomode fibers. The 
energy is exchanged among guided modes and is lost only when it 
reaches the guided modes of highest order. We treat this problem 
mathematically by assuming that the highest-order modes always 
carry zero power, because they lose their power directly to radiation. ® 
In addition, we assume throughout this discussion that the modes are 
coupled so tightly that the steady-state power distribution has been 
established.® This assumption allows us to use a definite radiation loss 
of the multimode fiber. If steady state is not reached, the fiber loss 
cannot be characterized by a single number independently of the input 
conditions. 

Our discussion has made it clear that essentially one single spatial 
frequency is responsible for radiation losses in fibers. In multimode, 
parabolic-index fibers, it is the spatial frequency that couples the 
guided modes among each other; in single-mode fibers, it is the spatial 
frequency © of (3). However, even though we can specify a definite 
spatial frequency for the single- as well as the multimode fiber, we still 
face the possibility that these frequencies may be different in either 
case. This raises the problem: what criteria are to be used by which a 
single-mode and a multimode fiber may be compared? In this paper, 
we shall use two different criteria. First, we asume that the spatial 
frequencies of interest are identical for both types of fibers. This re- 
quires us to assume that the core radius as well as the maximum 


938 THE BELL SYSTEM TECHNICAL JOURNAL, SEPTEMBER 1976 


refractive index difference between core center and cladding is different 
for each fiber type. When compared on this basis, the parabolic-index, 
multimode fiber is superior in its loss performance to the single-mode 
fiber, provided that a sufficiently large number of guided modes can 
exist. This result is intuitively apparent, since we have seen that the 
power has to ‘‘dribble’’ down through all the guided modes before it 
reaches the region of radiation modes and becomes lost. 

A second way of comparing single-mode and multimode fibers con- 
sists in considering configurations that are typical for each type of 
fiber regardless of the spatial coupling frequencies and determining 
quantitatively what their radiation losses are. It is clear that we must 
assume that the form of the spatial frequency spectrum is identical for 
each fiber type, otherwise no comparison would be possible. Compared 
on this basis the typical multimode, parabolic-index fiber has lower loss 
than the typical single-mode fiber. 

A step-index fiber operates in a single guided mode if its charac- 
teristic V-number has values V < 2.405. However, some fibers are 
made with a relatively thin inner cladding that is surrounded by an 
outer cladding whose refractive index is similar to that of the fiber 
core. An example of such a structure is the W-fiber.? Such fibers do not 
support guided modes in the strict mathematical sense; each mode 
loses power by leakage through the thin inner cladding and may be 
regarded as a leaky mode. The loss of the lowest-order leaky mode may 
be kept so small that it is negligible for practical purposes. The loss of 
the mode of next higher order may be considerably larger, causing its 
power to be lost in a relatively short distance. Such a fiber behaves as 
a single-mode fiber for V values exceeding V = 2.405. The radiation 
losses caused by random bends can be calculated by computing the 
power-coupling coefficient of the lowest-order mode to the next higher 
mode and by assuming that the higher-order mode is so lossy that 
it does not carry any power. We include such quasi-single-mode fibers 
in our discussion. Since it is impossible to anticipate the actual shape 
of the power spectra of the coupling function, we use simple power 
laws as models. Because only limited spatial frequency ranges need to 
be compared in any case, it should always be possible to approximate 
an actual power spectrum by a suitable power law. 


Il. LOSS FORMULA FOR THE SINGLE-MODE FIBER 


Random bends of an optical fiber can be described by assuming that 
the core-cladding boundary is changing as a function of the length 
coordinate z. Thus, we describe the core boundary of a fiber with 
random bends by the equation 


r(¢, 2) = a+ f(z) cos ¢. (4) 
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In this equation, a is the core radius and f(z) describes the shape of 
the randomly deformed fiber axis; ¢ is the angular coordinate of a 
cylindrical coordinate system. 

Using the standard coupled-mode formalism,® we derive the follow- 
ing formula for the radiation loss coefficient of any LP mode (for 
definition of tp, see Ref. 7) of the step-index fiber with randomly 
deformed axis, 


(ni — n3)7°J7 (xa) 


oo Daag T, sare | 


nak 2 _ J2= _1 (ca) 
ee || (F2(8, — B)) eae oJ» (oa)H®, (pa) — pJr-1(oa)H®™ (pa) |? 


Ji+1(0a) | as 


+ ToSu(oa) His (pa) — pvyi(oa)H (oa) |? ©) 


The Fourier spectrum of the distortion function f(z) appearing in (4) 
is defined as 


F(0) = lim — = [- flee-*#dz. (6) 


The ensemble average of the square of F'(@) can be expressed as the 
Fourier transform of the correlation function R(u) of f(z),8 


(F2(6)) = [ ; R(u)e-“du. (7) 


The other symbols appearing in (5) are defined as follows: 


2a = power attenuation coefficient 
a = core radius 
nm, = refractive index of fiber core 
mn, = refractive index of cladding 
vy = azimuthal mode number of te mode (vy = 0 for HEy 
mode) 
G, = propagation constant of guided Lp mode 


k = w(eouo)? = 2/d free-space propagation constant 
V = (n? — n3)*ka 
k = (nik? — B?)} 
a = [V? — (xa)? } 
o = (nik? — g)} 
p = (n3k? — p?)3 
J,(x) = Bessel function 
H(z) = Hankel function of the first kind, 
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Equation (5) was derived by using the approximate radiation modes of 
the fiber. This formula is, thus, accurate only if the radiation escapes 
at small angles relative to the fiber axis.> This condition is satisfied 
for power spectra that drop off rapidly with increasing spatial fre- 
quency, an assumption that is made throughout this paper. 

It was mentioned in the introduction that certain step-index fibers 
have claddings that are sufficiently narrow to provide high-leakage 
losses to all but the lowest-order mode. We obtain the microbending 
radiation loss for such fibers by assuming that two modes can exist 
in the fiber, but that only the lowest-order mode carries power. This 
mode is coupled to the next-higher-order leaky mode as well as to the 
radiation modes. However, since we assume that the amplitude of the 
spatial power spectrum of the coupling function decreases rapidly with 
increasing spatial frequencies, it is sufficient to consider only coupling 
to the leaky guided mode and ignore the radiation mode spectrum. 
The power-coupling coefficient between guided modes 0 and 1 is 
designated as hoi. The power loss coefficient for the HEy-mode in 
the presence of a second but leaky mode may thus be expressed as* 


RT3 (oa) T3 (cra) Ch F? (8) YF (3) 
2a2ngk J? (koa) | Jo (xa) J 2 (kia) | 


The subscript 0 refers to the HE;; mode while the subscript 1 indicates 
the next higher Lp mode.’ The spatial frequency 0 is defined as 


2a = hor a 





es Bog Ae 
0 _ Bo As Qnok (9) 


Equation (8) is applicable in the range 
2.405 < V < 3.882. (10) 
A discussion of the validity of using radiation modes to calculate 
the microbending losses is given in the Appendix. 


Ill. APPROXIMATE LOSS FORMULAS 


It is possible to approximate formulas (5) and (8) by expressions 
that show the relative influence of the fiber parameters and are simple 
enough to allow evaluation by pocket calculators. 

In the remainder of this paper, we restrict ourselves to power spectra 
of the form 


ea 
mo* sin — 
m 


D+) ] 


*See eqs. (8.6-10) and (5.2-20) in Ref. 5. 


(11) 
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This simple power law may serve as a model for every practical power 
spectrum, because we are usually interested only in a narrow spatial 
frequency region of the spectrum, which can always be approximated 
by a function of the form (11). The variance 6? is defined by (28); m 
is the exponent of the power law, and A@ is the width parameter of the 
distribution (11). 
For large values of the spatial frequency, we can express (11) in the 
simpler form, 
Am 
(r(0)) =o, (12) 
with 
Am = m(A0)"6? sin —. (13) 


If we use (12) and the transformation 
6 = Oz, (14) 
we can express (5) for » = 0 (HEj;-mode) in the form 


(ni — n3)ky?QJo 


(a) (72(Q)) 


i nad? (Ka) 
ce | Ji (ca) 
ih x” [oJ o(ca)H} (pa) — pJ1(ca)H§” (pa) |? ne 00) 


This expression is not much simpler than (5), but it shows clearly 
that the radiation loss depends indeed only on the spatial frequency 
6 = Q defined by (8). The upper limit © on the integral in (15) is an 
approximation that is valid for sufficiently large values of m. Equation 
(15) is useful only in the narrow interval 


1<V < 2.405. (16) 


For smaller V values, the HEi mode is so loosely guided that its field 
reaches very far into the cladding and suffers excessive bending losses. 
For 3.882 > V > 2.405, the step-index fiber is no longer supporting 
only one mode and (8) must be used. We have plotted (15) for a number 
of m values in the range (16) and used this plot to obtain the following 
empirical approximation of (15): 


Qa = Gi Nee (3) (v + 7.58 X 10-8 om): (17) 





This formula holds only for m = 4. The accuracy of (17) will be dis- 
cussed in connection with a discussion of Fig. 3. 

A good approximation [less than 0.5 percent error in the range (16) ] 
of the solution of the eigenvalue equation of the HE1,; mode can be 
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expressed empirically as follows: 


koa = 2.405 exp ( 


a 0.8085 ) | (as) 


V 
and (8) can be approximated by 


Be-mf-($) oe 


Equations (18) and (19) enable us to calculate loss values from (17) 
without having to solve the eigenvalue equation for the HE,,; mode. 
A similar empirical procedure allows us to approximate (8) as 





follows: 
_ (F?(Q)) 7 _ 0.616 |: 
2o0 = ngage | 9Y — 187) — Coane) 20) 
The spatial frequency © is defined by (9) with «1a approximated by 
0.8492 
Kia = 3.832 exp (- Tae)’ (21) 


In the range (10), this formula deviates from the exact solution of the 
eigenvalue equation by less than 0.5 percent (only very close to the 
endpoint at V = 2.41 is the error 0.8 percent). 


IV. MULTIMODE LOSSES 


The radiation loss formula for multimode parabolic-index fibers, 
whose mode distribution has reached steady state, has been derived 
earlier, ® 


Qa, = 5.8 OOo) A. (22) 


The refractive-index distribution inside the fiber core is given by the 


expression 
n(r) = m4 [1 — (z)a]. (23) 


The spatial frequency instrumental for mode coupling is® 


_ (24)? 
9p me Gg (24) 


The total number of guided modes of both polarizations and orienta- 
tions supported by the fiber is? 


N = (nha)? 5: (25) 
Since the core radius a appearing in (22) is different for single-mode, 
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step-index fibers and multimode, parabolic-index fibers, we must ex- 
press it in terms of some other quantity. We have mentioned in the 
introduction that the spatial frequency Q of eq. (8) characterizes 
coupling of the step-index fiber mode to radiation. One way of com- 
paring single- and multimode fibers is to insist that the critical coupling 
(spatial) frequency is the same for both fiber types. By eliminating a 
and A, we obtain from (22), (24), and (25), 


ny [ Oy \8 
2a = 1.4572 (22) k4(F? (6p) ). 26 
; The (ge) BPR CO.)) (26) 
The number of modes carried by the fiber can be varied by varying 
the core radius a, the relative refractive index A, or the free-space 
propagation constant k. Keeping @, and k constant requires us to vary 
a and Aas N is varied. 


V. COMPARISON OF SINGLE-MODE AND MULTIMODE FIBER LOSSES 


We represent the radiation loss coefficients in normalized form by 
dividing 2a by the product k*(F?). According to its definition (6) or 
(7), the power spectrum has the dimension cm', Since k has the same 
dimension as 2a, the ratio is dimensionless. The spatial frequency @ and 
the shape of (F?(@)) are always chosen to be the same for the single- 
mode or multimode fibers that are to be compared. In Figs. 1 and 2, 
we have plotted the normalized loss coefficient of the single-mode 


10-5 
8 
6 


4 


2ag /k4C F2(:)> 


-7 
10-5 2 4 6 8149-4 2 4 6 8149-3 2 4 6 849-2 2 4 6 8149-1 2 4 6 8499 


A@/k 
Fig. 1—Normalized loss coefficient of the single-mode, step-index fiber with power 
spectrum defined by (11) as a function of the width parameter A@/k of the power 


spectrum. The curve parameter m indicates the exponent of the power law. n; = 1.515, 
me = 1.5, V = 2.4. 
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10-5 2 4 6 Byg-4 2 4 6 ByQ-3 2 4 6 BiQ-2 2 4 6 Byp-1 2 4 6 8490 
A@/k 


Fig. 2—Similar to Fig. 1 with V = 2. 


fiber as a function of the normalized width parameter A@/k of the power 
spectrum (11) for nm. = 1.5 and ny = 1.01 ne. The loss curves were 
computed from (5) and the frequency Q appearing in the argument of 
the power spectrum is defined by (8). Figures 1 and 2 show that the 
loss coefficient is independent of the width parameter for small values 
of Aé/k. The dependence of the normalized loss on the normalized 
frequency V is illustrated in Fig. 3 for the case that the power spectrum 
(11) can be approximated in the form (12). The solid curves are com- 
puted from (5) while the dotted curves are calculated from the approxi- 
mate formulas (17) through (19). It is apparent that the approximation 
is quite good, particularly for large values of the exponent m. The dash- 
dotted curves in Fig. 3 illustrate how the approximation deteriorates 
if the term in (17) containing V” is omitted. It is apparent that this 
term provides an important correction for large values of V. The range 
of V values on the abscissa of Fig. 3 coincides with the single-mode 
range of (16). Formulas (5) or (17) do not hold for larger V values. For 
2.405 < V < 3.83, eq. (8) or its approximation (20) apply. The agree- 
ment of the approximation (20) with (8) is at least as good as that of 
the solid and dotted curves with m = 10 in Fig. 3; therefore, only a 
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2a /k*CF 2(0)> 





10711 
1.0 1.2 14 1.6 18 2.0 2.2 2.4 


Fig. 3—Normalized loss coefficient of the single-mode, step-index fiber as a func- 
tion of the normalized frequency parameter V for several values of the power law 
exponent m. The loss values in this curve correspond to the flat portions on the left- 
hand side of Figs. 1 and 2. The solid curves correspond to eq. (5) while the dotted 
curves were computed from the approximate equation (17). The dash-dotted curves 
represent (17) without the term V™. mn; = 1.515, m2 = 1.5. 


single solid curve was drawn in Fig. 4 to represent these formulas. 
In the vicinity of V = 2.4, the simple equation (8) and consequently 
its approximation (20) are not exact. As long as the value of the propa- 
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4x1077 
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2a / k4<F2()> 


2x107? 


107? 
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Vv 


Fig. 4—Plot of eq. (20). This curve represents the normalized radiation loss of a 
quasi-single, HE11 mode in the presence of a leaky (next-higher-order) mode. The 
curve in this figure continues the curves in Fig. 3 to higher values of V, m. = 1.515, 
ne = 1.5. 


gation constant @; is very close to n2k, coupling to the radiation modes 
is not entirely negligible. It is apparent from Fig. 3 that the loss curves 
for different values of m have a tendency to converge to one curve. 
In Fig. 4 this convergence has taken place, but the transition from 
the many curves to one single curve is lost. 

A comparison between the loss coefficients of the single-mode, step- 
index fiber and the multimode, parabolic-index fiber is possible only 
if certain assumptions are made. Figure 5 presents plots obtained from 
the loss formula (26) of the multimode fiber. We have drawn these 
curves by requiring 6, = Q with Q given by (3) and @, defined by (24). 
Multimode and single-mode fibers now sample the power spectrum 
at the same spatial frequency. The independent variable N in Fig. 5 
represents the total number of modes that can propagate in the multi- 
mode fiber. As the number of modes changes, the fiber dimensions must 
change accordingly, keeping the value of 6, constant. 

The curves in Fig. 5 are labeled by the values of the normalized 
spatial frequency Q/k; the corresponding values of the V number of 
the single-mode fiber are written in parenthesis. These V values are 
needed for a comparison of the loss values in Figs. 3 and 5. The normal- 
ized loss values are directly comparable since the same power spectrum 
at the same spatial frequency has been used in both cases. However, 
each curve of Fig. 5 must be compared with the curves in Fig. 3 at the 
proper V value. It is apparent that the loss of the multimode fiber 
can always be reduced below the loss of the single-mode fiber if the 
number of modes, N, is made large enough. For example, a single- 
mode fiber will normally be operated near V = 2.4. The corresponding 
curve (with V = 2.4) in Fig. 5 lies below the single-mode loss value 
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2ap/ k4<F21.Q)> 


4 6 8493 2 4 6 8104 2 4 6 8 105 


N 


Fig. 5—Normalized loss coefficients of the multimode, parabolic-index fiber which 
is designed so that its spatial frequency 6p [eq. (24)] coincides with the spatial fre- 
quency © [eq. (3)] of the single-mode fiber. The values of Q/k label the curves; the 
values shown in parenthesis are the corresponding V values of the single-mode fiber. 


for N > 380. For a fiber with np = 1.5 and A = 0.01, N = 30 cor- 
responds to ka = 52, for \ = 1 um we have a core radius of a = 8 um. 
Next we compare the microbending losses of single- and multimode 
fibers by assuming typical fiber parameters. Single-mode fibers must 
be designed to have a V value near V = 2.4. If the fiber has a narrow 
inner cladding surrounded by an outer cladding, whose refractive index 
is similar to that of the core, the leaky mode losses of modes of order 
higher than HEy; may be so large that the fiber behaves effectively as 
though it supported only one guided mode and its V value may be 
chosen to be above V = 2.405. After the V value has been chosen, it 
is desirable to make the core radius a as large as possible to ease the 
splicing problem. We assume that fiber core radius values in the range 
a = 5 um to 10 um are of interest. Thus, with } = 1 um, we are 
interested in ka values in the range 40 < ka < 60. With V = 2.4, the 
corresponding index difference would be in the range 0.0005 < m1 — ne 
<0.002; for V = 3.5, we have instead 0.001 < m1 — ne < 0.0045. 
For a typical multimode, parabolic-index fiber, we use A = 0.01 and 
let the core radius fall in the range 16 < a < 35 um so that with 
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\ = Lum, we have 100 < ka < 220. The cladding index is assumed to 
be that of fused silica, ne = 1.457, for both types of fibers. Since there 
is now no fixed spatial frequency that’ we may use for normalization 
purposes, we use the power spectrum at the spatial frequency 6 = nokA 
to normalize the loss coefficient. This normalization uses the A value 
of the multimode fiber (A = 0.01) and is certainly quite arbitrary, 
particularly for the single-mode fiber. However, a common normali- 
zation for both fiber types is necessary if we want to compare the loss 
values. 

Figures 6 through 9 present normalized loss values for the single- 
and multimode fibers for m = 4, 6, 8, and 10. The single-mode losses 
are plotted as solid lines for different values of V; the dotted lines 
indicate the multimode loss values. The upper scale in the abscissa 
ranging from ka = 30 to ka = 60 applies to the single-mode curves, 
while the lower abscissa from ka = 100 to ka = 220 applies to the 
multimode curves. We have included two curves with V = 3.1 and 


kag 


2a/k4 <FA(ngk A) > 


= 
° 
d 


MULTIMODE 





10-6 
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Fig. 6—Normalized radiation losses of single-mode (solid curves) and multimode 
(dotted curve) fibers. The upper scale on the abscissa applies to the single-mode case 
while the lower scale belongs to the multimode fiber. The power spectrum used for 
normalization of the loss factor is taken at the spatial frequency 6 = n.kA with 
C ) 0.01, ne = 1.457. This figure uses the exponent m = 4 of the power spectrum 

12). 
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Fig. 7—Same as Fig. 6 with m = 6. 


V = 3.8 in Figs. 6 through 9 to illustrate the case of a fiber that is not 
strictly speaking “single mode,” but supports an additional leaky 
mode. The curves for V in the range 1.5 < V < 2.4 were computed 
from (17) through (19) while the curves with V = 3.1 and V = 3.8 
were computed from (20). The dotted curve for the parabolic-index 
fiber was obtained from (22). 
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Fig. 8—Same as Fig. 6 with m = 8. 


It is apparent that typical multimode fibers have lower losses than 
typical single-mode fibers. 


VI. CONCLUSIONS 


We have presented loss formulas for the single-mode, step-index 
fiber and for the multimode, parabolic-index fiber. The step-index fiber 
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Fig. 9—Same as Fig. 6 with m = 10. 


formulas were approximated by simple empirical approximations to 
obtain analytical expressions that can be easily evaluated with the 
help of a pocket calculator. 

We compared the radiation losses of both fiber types that are caused 
by random deflections of the fiber axis. It was our objective to deter- 
mine which fiber type is more sensitive to these microbending losses. 
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Our conclusion is that the single-mode fiber is more susceptible to 
microbending losses than the parabolic-index, multimode fiber. Since 
both fiber types are useful for wideband communications purposes 
using GaAs injection lasers or LEDs (in the case of the multimode fiber 
only), a knowledge of their respective microbending losses is important 
for system considerations. 
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APPENDIX 
Validity of the Loss Formula in Equation (5) 


Equation (5) was derived under the assumption that the fiber core 
is surrounded by an infinite cladding. Actually, cladding modes play 
the role of our radiation modes and the question arises: how accurate 
is a description of mode losses in terms of radiation modes? To answer 
this question, we compare the results of numerical evaluations of (5) 
with a theory published by Kuhn,‘ who uses cladding modes in a fiber 
with lossy jacket to calculate mode losses caused by random bends. 
Kuhn assumes that the curvature of the fiber can be described by either 
an exponential or gaussian correlation function. Since our formulation 
uses the function f(z) to describe the fiber axis, we must first transform 
our equations into a form that allows us to compare it to a description 
in terms of fiber curvature. If we denote the power spectrum of the 
curvature function by (C?(@)), the following relation holds: 


(F2(@)) = S$). (27) 
The variance of f(z) is defined as 
e = * I, ” (F2(@))d8. (28) 


The variance of the curvature spectrum is similarly defined as 


((3)) =: fo @@yas. (29) 


If we define the power spectrum (F?) with variance ¢*, we can compute 
the corresponding variance for the curvature spectrum only if the 
integral (29) exists. For an exponential correlation function of the 
curvature, the power spectrum (C?) has a Lorentzian shape® that re- 


MICROBENDING LOSSES OF FIBERS 953 


mains finite at @ = 0 so that the integral in (28) does not exist. Cor- 
respondingly, (29) does not exist for a Lorentzian shaped (F?). Thus, 
a comparison between Kuhn’s theory and ours is possible only for 
the gaussian correlation function. 

Kuhn uses the following autocorrclation function for the fiber 
curvature :* 


Relu) = (( j) ) exp (—2*/D?). (30) 


D is the correlation length. The power spectrum of the distortion func- 
tion f(z) is, thus, 


(¥(0)) = Ve) ) Geexp L~ 6D/2)*I (31) 


Numerical integration of (5) yields the solid curve for the normalized 
radiation power loss shown in Fig. 10. This curve was computed for 
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Fig. 10—Comparison between Kuhn’s theory and our theory for nm = 1.51, 


n, = 1.5, and V = 2.18. The autocorrelation function of the fiber curvature is 
gaussian. 
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the following conditions: 


Ne = 1.5 
ka = 12.56581. 


These values lead to V = 2.18. The eigenvalue equation for Lp modes? 
allows us to calculate ka = 1.5851486. 

The dashed curve in Fig. 10 was obtained from Kuhn’s publication.‘ 
It is apparent that the two theories agree very well for large correla- 
tion length corresponding to small spatial frequencies. The agreement 
is not quite as good for short correlation length or high spatial fre- 
quencies. Coupling spectra with large spatial frequencies lead to radia- 
tion escaping from the fiber core at large angles. Our eq. (5) is limited 
to small angle radiation. The solid curve in Fig. 10 was computed by 
using (5) for large. correlation length, but using a corresponding for- 
mula derived with the help of free-space radiation modes for small 
correlation length. However, the longitudinal components of the guided 
and radiation modes were ignored, which led to an underestimation of 
the loss for radiation escaping at large angles. In the main part of this 
paper, we are interested only in coupling functions whose Fourier 
spectra drop off rapidly with increasing spatial frequency, so that only 
small spatial frequencies are important; eq. (5) is thus applicable. The 
comparison of our theory with Kuhn’s shows that no large error results 
from using radiation modes instead of cladding modes to compute 
microbending losses. 
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This paper compares the performances of equalizing and nonequalizing 
repeaters in optical fiber systems in the presence of delay distortion leading 
to intersymbol interference on the received optical pulse stream. Curves of 
receiver sensitivity loss vs delay distortion are computed for nonequalizing 
repeaters for various received optical pulse shapes. These results are then 
compared to previously published results for equalizing receivers. The 
results indicate that for modest delay distortion (less than 5 dB of receiver 
sensitivity loss) there are no significant advantages of the equalizing 
receiver over the nonequalizing type. 


I. INTRODUCTION 


This paper is concerned with the effects of intersymbol interference 
on the performance of equalizing and nonequalizing optical fiber 
system repeaters. Recent advances in the fabrication of graded-index 
optical fibers and improvements in laser lifetime should result in the 
ability to design fiber systems that are essentially loss-limited for 
data rates below 100 Mb/s. Nevertheless, it is interesting to study 
the effects of small amounts of intersymbol interference due to fiber 
delay distortion on the performance of optical fiber system repeaters.* 
Repeaters which can accommodate small amounts of intersymbol 
interference increase the yield of usable fibers in real cables where 
the index gradings are not ideal. 

In a previous paper, the effects of intersymbol interference on 
equalizing repeaters were studied. Such repeaters compensate for the 
fiber delay distortion (pulse spreading) by incorporating a high- 
frequency rollup filter in their linear channels. This rollup results in an 
enhancement of the receiver noise. Using the results of that paper, 
curves can be plotted of receiver sensitivity loss vs the spreading of 
the received optical pulses. 


* Throughout this paper we shall assume that optical power pulses that overlap at 
the receiver add linearly, as justified in Ref. 3. 
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Fig. 1—Equalizing receiver. 


In this paper, we shall present similar curves for nonequalizing 
repeaters. In such repeaters, pulse spreading results in an uncompen- 
sated closing of the receiver output “‘eye’’, leading to a sensitivity 
loss. Nonequalizing receivers do not require a knowledge of the 






OUTPUT PULSE (ARBITRARY UNITS) 


TIME SLOTS, t/T 


Fig. 2(a)—Ideal output pulse; Vour(t), 7 = 1.28. 
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Fig. 2(b)—Ideal eye diagram; Vour(t), 7’ = 1.28. 


received pulse shape. This is particularly important in systems using 
graded-index fibers, where residual pulse spreading, due to grading 
errors, can lead to unpredictable received pulse shapes. 

Numerical results indicate that there is no significant advantage 
in equalizing over nonequalizing repeaters for sensitivity losses below 
5 dB. There is no drastic deterioration in performance of the non- 
equalizing repeater compared to the equalizing type. As a consequence, 
the importance of hypothetical receivers that estimate the received 
pulse shape in order to adjust adaptive equalizers is considerably 
diminished. 


Il. THERMAL-NOISE-LIMITED SYSTEMS 


This paper is concerned primarily with thermal-noise-limited systems. 
This restriction drastically simplifies the analysis compared to systems 
limited by randomly multiplied signal-dependent shot noise. We shall 
briefly discuss shot-noise-limited systems in Section III. 


2.1 Equalizing receivers 


An equalizing receiver is shown in Fig. 1. Observe that the output 
pulse shape Vout(é) is independent of the fiber baseband frequency 
response Hpper(f). We assume that we have chosen Vour(t) from the 
raised cosine family! with parameter 6 = 1. Figure 2 shows pictures 
of Vout (¢) and its associated ‘‘eye diagram.” (The eye diagram is defined 
in the Appendix.) 
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The rms noise at the receiver output is dependent upon the fiber 
baseband frequency response (through the equalization) and the type 
of amplifier being used. If we normalize the eye opening to unity, we 
obtain an expression for the required optical power at the receiver 
input to achieve a 10~° error rate of the following form!: 


12hQ 
P optical required — a Z i, (1) 


where 


we ie photon energy/detector quantum efficiency 


Z = normalized mean-square recciver output noise.! 


In eq. (1), Z is an increasing function of the required equalization 
and is written in the following form: 


z= fo Pa Dl/| Hoe Af 
+8 fo [Pa DiP/|HoalA) df, ©) 


where Fr(f) depends upon the desired receiver output pulse and the 
transmitted pulse and is typically close to unity in value, 


B = bit rate 
1, Y2 = constants depending upon the amplifier being used. 


In eq. (2), if the term involving y; dominates, we have a low- 
impedance front end. If the term involving y2 dominates, we have a 
high-impedance front end. 

Using eq. (2), we can calculate curves of the increase in required 
optical power (sensitivity loss) as a function of the fiber delay 
distortion. 


2.2 Nonequalizing receivers 


A nonequalizing receiver is shown in Fig. 3. In this receiver, the 
output pulse shape depends upon the baseband frequency response, 
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Fig. 3—Nonequalizing receiver. 
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but the rms noise, Z?, does not. To calculate the receiver sensitivity, 
we must know the eye opening at the sampling time. We shall assume 
that in the absence of delay distortion in the fiber, Hape(f) = 1, the 
receiver output pulse is a raised cosine 8 = 1 pulse, as shown in Fig. 2. 
The output pulse and eye pattern resulting after delay distortion is 
included can be calculated in a straightforward manner. The required 
power to achieve a 10~° error rate is given by 


hQ 
PF siiical required 12 7” (0) (Z)}, (3) 


where 


9 = eye opening at the sampling time (@ = 1 corresponds to a 
completely open eye) 

Z, = the value of mean squared thermal noise Z when there is no 

fiber delay distortion, i.e., Z of equation (2) with Hgpe(f) = 1. 


2.3 Examples 


Having defined the equalizing and nonequalizing receivers, we can 
now present computational results for the effects of fiber delay dis- 
tortion. To make such calculations, we must specify the shape of the 
fiber impulse response Apper(t), which is the Fourier transform of the 
fiber frequency response Hpper(f). In the examples below, we shall 
consider a variety of impulse response shapes. 


2.3.1 Gaussian-shaped impulse response 


In the absence of detailed knowledge of the fiber impulse response, 
we can begin by assuming a gaussian shape. Such a shape is ap- 
propriate for long fibers with mode mixing, and is given by the 
expression 


Raver(t) = exp — [{(@/T)/(o/T)}?/2], (4) 


where t/T is the time variable normalized to the units of time slots, 
and o/T is the ‘rms width” of the impulse response in time slots. 

Figure 4 shows curves of the deterioration of the receiver sensitivity 
as a function of o/T. Curves are given for a nonequalizing receiver, 
a high-impedance equalizing receiver, and a low-impedance equalizing 
receiver. All curves are normalized to zero sensitivity loss for o/T = 0 
(sensitivity loss is the increase in required optical power in dB to 
maintain a fixed error rate). The curves do not show the relative 
performances of high- and low-impedance receivers but only the 
added effects of delay distortion. Figures 5 through 8 show receiver 
output eye diagrams and pulse shapes for ¢/T = 0.35 and ¢/T = 0.55 
and a nonequalizing receiver. 
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ey losses vs o/T for a gaussian impulse response, thermal noise 
imited. 


In this example, we see that the nonequalizing receiver may be 
better or worse than the equalizing types depending upon the front- 
end design. (The optimal receiver would be some partial equalization 





—1.0 —0.8 —0.6 —0.4 —0.2 0 0.2 0.4 0.6 0.8 1.0 


Fig. 5—Eye diagram for gaussian impulse response; ¢/7’ = 0.35. 
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Fig. 6—Receiver output pulse for gaussian impulse response; ¢/T' = 0.35. 
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Fig. 7—Eye diagram for gaussian impulse response; ¢/7’ = 0.55. 
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Fig. 8—Receiver output pulse for gaussian impulse response; ¢/T = 0.55. 


compromise between noise enhancement and intersymbol interference 
reduction.) The nonequalizing receiver does not deteriorate in perform- 
ance at a significantly faster rate than the equalizing receiver. 


2.3.2 Skew-shaped experimental pulse 


Figure 9 shows a pulse response measured on an actual fiber. The 
fiber was 740 meters long. This pulse has an rms width o = 0.77 ns. 
If we vary the bit interval 7, we can calculate the receiver sensitivity 
loss for equalizing and nonequalizing receivers. The results are shown 
in Fig. 10. Figures 11 and 12 show the receiver output eye diagrams 
for o/T = 0.24 and o/T = 0.6. In Fig. 10, it is assumed that the 
nonequalizing receiver decision circuit samples the receiver output 
signal at the peak eye opening. This assumption is subject to further 
investigation concerning the design of timing recovery circuits. 

We can propose a receiver which equalizes only the phase of the 
fiber baseband frequency response, leaving the amplitude rolloff 
uncompensated. Figure 13 shows the receiver sensitivity loss of such a 
phase-equalizing receiver vs ¢/7'. Also shown are the curves given in 
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‘a 


Fig. 9—Impulse response of borosilicate fiber 740 meters long. 


Fig. 10. We note that the phase-equalizing receiver has no significant 
benefit over the nonequalizing receiver, except that sampling at the 
peak of the eye opening may be easier. Figure 14 shows the receiver 
output eye diagram for o/7 = 0.6 for the phase-equalizing receiver. 
This can be compared to Fig. 12. 


2.3.3 Gaussian pulses on a pedestal 


In fibers with graded indices, it is sometimes observed that the 
impulse response can be divided into two parts: (2) a narrow part 


7 


NO EQUALIZATION ~ ea 


o 


EQUALIZATION, 
HIGH IMPEDANCE~_ 
~ 


BSS 


w 





SENSITIVITY LOSS IN dB 
N 


~N 
~~ EQUALIZATION, 


= 


LOW IMPEDANCE 


0 0.1 0.2 '0.3 0.4 0.5 0.6 0.7 
o/T 5 


Fig. 10—Sensitivity losses vs ¢/T for borosilicate fiber. 
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Fig. 11—Eye diagram for borosilicate fiber; ¢/T = 0.24. 


containing (hopefully) most of the energy and (72) a wider part (tail) 
corresponding to high-order modes with long delays, or cladding modes 
coupled to the core modes. To simulate the effect of such a fiber 
impulse response on the system, we model it as the sum of a gaussian- 
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Fig. 12—Eye diagram for borosilicate fiber 740 meters long; o/7T = 0.60. 
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Fig. 183—Sensitivity losses vs ¢/T for borosilicate fiber (includes phase equalization). 


shaped pulse of area (1 — A) and rms width o and a rectangular pulse 
of area A and full width W. An example is shown in Fig. 15. 

Because of the uncertainty in the values of A, ¢/7, and W/c, which 
will be encountered in practice, we shall consider two cases: 


Case 1: W/o = 6.4, o/T = 0.5, andO S A S 0.2. 
Case 2: W/o = 12.8, o/T = 0.25,0 S$ A S 0.84. 
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Fig. 14—Eye diagram after phase equalization for borosilicate fiber 740 meters 
long; ¢/T = 0.60. 
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p(t) 


(1-A) eon A luge) = Us (t -¥)). 


p(t) = eas W 5 


WHERE Ug= UNIT STEP FUNCTION 





Fig. 15—Gaussian pulse on a pedestal. 


Figures 16 and 17 show calculations of receiver sensitivity loss 
(relative to a very-narrow-shaped fiber impulse response) as a function 
of A for cases 1 and 2, respectively, and for different receivers. Also 
shown are curves of 10 log [1/(1 — A) ] which correspond to the loss of 
total pulse energy to the pulse tail. We observe that tails affect the 
nonequalizing receiver more than the equalizing receiver. However, 
more calculations must be made to understand fully the effects of 
tails and how to combat those effects. Before these calculations can 
be made, more data on the impulse response of real fibers are needed. 
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Fig. 16—Receiver sensitivity loss for Case 1: ¢/T = 0.5 and W/o = 6.4. 
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Fig. 17—Receiver sensitivity loss for Case 2: ¢/T = 0.25 and W/o = 12.8. 


Il. SHOT-NOISE-LIMITED SYSTEMS (AVALANCHE GAIN) 


The extension of the previous results to shot-noise-limited systems 
requires one to take into account the signal dependence of shot noise. 
A rough approximation suitable for comparing nonequalizing and 
equalizing receivers in the following. For equalizing receivers, we can 
use the techniques described in Ref. 1 to calculate receiver performance 
degradation with increasing fiber delay distortion. Nonequalizing 
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Fig. 18—Sensitivity losses vs ¢/T for a gaussian impulse response, shot noise limited. 
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receivers can be studied by assuming that the shot noise at the receiver 
output is proportional to the average optical power falling on the 
detector, but is only weakly dependent upon the shape of the received 
pulses. In that case, the required received optical power increases 
approximately as —50/6 log (eye opening). This can be compared to 


the thermal-noise-limited receiver where the required optical power 
increases as —30/6 log (eye opening). 


An example of a receiver with avalanche gain is shown in Fig. 18. 
The figure shows a plot of receiver sensitivity loss vs ¢/T for a gaussian 
fiber impulse response and for equalizing and nonequalizing receivers. 
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APPENDIX 
Receiver Output Eye 


The output of the optical fiber receiver consists of a sequence of 
pulses plus noise: 


Vou(t) = 2 auhourlt — KP) +n), (5) 


where 


az = 0 or 1 represents the information 
T = pulse spacing 
n(t) = noise. 


In general, these pulses overlap, a characteristic which is referred to 
as intersymbol interference. Once every J seconds, the output is 
sampled (observed) to decide whether or not a pulse is present (a, = 1 
or a; = 0). In the absence of intersymbol interference, the ability to 
make this decision is limited by the noise. However, due to intersymbol 
interference, the receiver output at the sampling time can be smaller 
or larger than it would be if only one pulse were present. Pessimisti- 
cally, we can calculate the smallest value that Vour(¢) can assume given 
that the pulse we are looking for is on (a, = 1), and we can also 
calculate the largest value that Vour(t) can assume given that the pulse 
we are looking for is off (a, = 0); Aout both of these neglect noise n(é). 


Vrain on(t) = hou(t — kT) + DX [min of hour(t — k’T) and 0] 
kk 
6 
Vnexon(t) = 5 [max of hous(t — bP) and 0] (8) 
'F 
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The “eye diagram” is a plot of Vinin on(t) and Vmax on (t) on the same 
abscissa (see Figs. 5 and 6). The maximum difference between the 
upper and lower curves (before they cross) is called the eye opening. 
This maximum difference occurs at a point in time where the best 
chance to distinguish between ‘‘pulse present’? and ‘‘pulse absent’ 
exists. The ratio of this peak eye opening to the peak height of an 
isolated pulse represents a loss in noise immunity and is called the 
fractional eye opening. 
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Derivative Measurements of Light-Current- 
Voltage Characteristics of (Al,Ga)As 
Double-Heterostructure Lasers 


By R. W. DIXON 
(Manuscript received February 17, 1976) 


A modulation and detection scheme for applying derivative techniques 
to the investigation of stripe geometry (Al, Ga)As double-heterostructure 
lasers 1s described. Modulating at constant modulation index allows the 
quantities I(dV/dlI) and I?(d?V/dI*) to be directly obtained in the same 
apparatus at the first and second harmonic of the modulation frequency, 
respectively. Particularly strong indications of laser action and other 
optical interactions in the laser are contained in the second harmonic 
voltage response. The same apparatus may be used to obtain derivatives 
of the light-current relation. These are found to sensitively reveal light- 
current nonlinearities that are believed caused by filaments and other 
spatial inhomogeneities and instabilities. 


1. INTRODUCTION 


Derivative techniques are employed in many fields of science and 
engineering. Reasons for their use range from the enhancement of 
signal-to-noise ratio, as in nuclear magnetic resonance and optical 
reflectivity investigations of band structure, to the convenience of 
direct measurement of dynamic transistor characteristics. Usually the 
techniques measure system response to a small ac modulation, often 
using phase detection for signal enhancement. 

These techniques have recently been successfully applied to an 
analysis of the current-voltage characteristics of GaAs injection lasers. 
The first derivative dV/dI has been shown to be useful, both in homo- 
junction! and in double-heterostructure lasers,? for measuring the 
parameters entering the J-V characteristic as well as for extracting 
lasing threshold and other features intimately related to the lasing 
process. 

This paper describes a modulation and detection technique which 
has the advantage of allowing the direct measurement of [(dV/dI), 
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which is often the desired quantity (rather than dV/dI). The tech- 
nique also allows direct measurement of [?(d?V/dI?). Very strong 
second-derivative signals near and above lasing threshold are observed. 
Additionally, derivatives of the light-current characteristic are dis- 
played and, as anticipated, show much sharper detail, near lasing 
threshold, and when filamentary instabilities occur, than does the 
light-current characteristic itself. All of these properties can be mea- 
sured in the same apparatus. 


Il. SIMPLE THEORY 


Much of the value of the experimental technique results from the 
modulation of the laser current at constant modulation index, m = AI/ 
I = constant, rather than alternatives such as constant modulation 
amplitude, AJ. To appreciate the reasons for this advantage consider 
a laser modeled as an ideal p-n junction in series with a resistance 
of value R,.'.? The current-voltage characteristic of this idealized de- 
vice would be 

decd get ie iets 1; (1) 


where conventionally 8 = q/nkT, and the other parameters have their 
usual meanings. Solving eq. (1) for V, neglecting in this context the 
—1 term, and the current dependences of the parameters of eq. (1), 
and differentiating yields!? 


dV 1 


dl ae 18 ae R, (2) 
and 
aV 1 
aP ~~ BP @) 
For many purposes,” the preferred forms of eqs. (2) and (8) are 
dV i 
Ia = gti. (2a) 
and 
PV 1 
Son 
I TIE B? (8a) 


thus allowing the direct extraction of the parameters 6 and R, without 
having to deal with the inverse current dependences. (See Appendix.) 
Also, the forms (2a) and (3a) lend themselves to the investigation of 
the small J-V changes which occur because of the optical-electrical 
interactions in the laser. It is worth emphasizing that direct measure- 


ments of the quantities I(dV/dI) and [?(d?V/dI?) are desired as 
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opposed to the synthesis of these quantities from other measured 
parameters. This improves both the measurement accuracy and the 
experimental simplicity. 

To see how this is possible consider that a current of the form, 
I7 = I + AI cos Q is applied to the diode, the response voltage is 
expanded in a Taylor series, and coefficients in Q and 20 collected. 
The result is 

dV 


V (9) = mI (4) 
and 
V (2) = (F) eae (5) 


where m = (AI/I) is the modulation index, and only lowest-order 
terms have been retained.* Thus, if the laser can be modulated keeping 
the modulation index constant, the desired quantities may be obtained 
directly at the frequencies 2 and 2Q. 


ill. EXPERIMENTAL 
3.1 Detection at the modulation frequency (Q) 


The experimental apparatus used is shown in Fig. 1. The key com- 
ponent is the current source. It has, as an offshoot of the feedback 
circuitry that maintains a constant current for varying load impedance, 
the property that it can be easily modified so that modulation at 
constant m occurs when an appropriate low-frequency (<1 kHz) ac 
signal is applied. When the phase detector is tuned to the modulation 
frequency 2 [mode (a) in Fig. 1], laser response voltages like that 
labeled I[(dV/dI) in Fig. 2 are obtained. This curve has features 
analogous to those seen previously.4? These include y-axis intercept 
[equal to 1/8 on the simple theory, eq. (2a) J, the slope (proportional 
to series resistance R, on the simple theory), and an indication of 
voltage saturation near lasing threshold. System response was checked 
both with resistors and with a silicon diode replacing the laser, and 
the known values of R and @ were reproduced very well. The signal 
V(Q) was also confirmed to be linearly proportional to m as expected 
from eq. (4). Many different lasers were measured and their results 
seemed generally similar although major deviations from slope linearity 
below threshold were sometimes observed, as were significant differ- 
ences among the forms of the I(dV/dI) responses near and above 
lasing threshold. Voltage saturation was almost never complete, and 


“ Higher-order corrections depend on higher powers of m and are not significant 
for m small. 
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Fig. 1—Schematic diagram of apparatus used to obtain voltage and light deriva- 
tives in (Al, Ga)As double-heterostructure lasers. Mode (a) detects V(Q) « I(dV/dI) 
and mode (b) detects V(2Q) « J?(d?V/dI*). 


MODULATION 
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filament changes resulted in complex I(dV/dI) responses. The ex- 
perimental technique, however, seems accurate, easily calibrated, and 
simple to use. 


3.2 Detection at twice the modulation frequency (20) 


An important advantage of this technique is that it can also be used 
to measure the voltage response at frequency 22 and thus to obtain 
the second derivative [?(d?V/dI’). Detection at frequency 22 was 
accomplished by operating the phase detector in the f/2 mode [mode 
(b) in Fig. 1]. In this mode, the instrument produces an internal refer- 
ence frequency at exactly twice the frequency of the externally applied 
reference, thus making the detection system sensitive to V (2Q). 

Trace (b) of Fig. 2 shows a typical signal detected with the system 
operated in the 20 mode. As anticipated, the signal-to-noise ratio is 
very large. The initial shape (with current increasing) seems quite 
reproducible from laser to laser while the structure at higher currents 
is laser-dependent. Again, the signal appears greatly affected by fila- 
mentary laser operation. When the laser was shorted or replaced by a 
resistance, a horizontal straight line resulted, as expected. As an 
additional check, the dependence of V(2Q) on modulation index was 
measured below threshold over the range 0 S m S 0.10 and confirmed 
to be accurately m?, consistent with theory [eq. (5) ]. Thus, there is 
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Fig. 2—Typical V(Q) and V(2Q) responses [curves (a) and (b)] as functions of 
laser current measured using apparatus of Fig. 1. The zero current intercept of 
I(dV/dI) and the initial negative offset of J?(d?V/dI?) are proportional to 
8-1 = (nkT/q) when the simple model described in the text is valid. Note voltage 
saturation in J(dV/dI) and the very strong second-derivative signal near threshold. 


little indication of feedthrough from the first harmonic or of harmonic 
distortion of the modulating frequency affecting the detected voltage 
at the second harmonic. If very accurate measurements were con- 
templated, this would merit more serious investigation (see, for ex- 
ample, Korb and Holonyak**). 


3.3 Light-current derivatives 


Using the circuit depicted in Fig. 1, the derivatives I(dL/dI) and 
I?(#L/dI*) can also be directly obtained. This was accomplished by 
placing the laser in a holder in which the optical emission from each 
mirror face could be separately monitored with silicon photodiodes. 
The photodiode current then served as the input to the phase detector. 
This more traditional application of derivative techniques may prove 
to be a useful adjunct to L(J) measurements, not only to provide 


* Also, T. L. Paoli has considered a more complex technique wherein modulation is 
provided at two frequencies Q; and Q2 and the system response is sought at frequency 
|Q, — Q2|, thus eliminating the harmonic relationship with the interfering signals. 
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accurate measurement of parameters, such as differential quantum 
efficiency [«(dL/dI)], but also to investigate filaments and other 
spatial inhomogeneities in the laser cavity. Typical derivative signal 
responses are shown in Fig. 3 along with a light-output-current charac- 
teristic measured in the same apparatus. A device with significant 
curvature in the L-I was used as an example to illustrate the en- 
hancement of such slope changes using the derivatives. As expected 
from earlier results, the emission from the two laser mirrors was often 
unsymmetric and was usually very unsymmetric when L-J kinks® 
were present. These kinks are thought* to be associated with spatial 
inhomogeneities in the laser cavity. This nonsymmetry was also present 
in the light-current derivatives. Very sharp derivative structure was 
seen near the light-current kinks. These observations strongly suggest 
that light-current derivatives, either alone or in combination with 
voltage derivatives, will prove very useful for investigating light- 
current nonlinearities, spatial inhomogeneities, and their relationship 
in injection lasers. 





160 
DIRECT: CURRENT IN mA 


Fig. 3—Typical results for (dL /dI) and I?(d?L/dI*) vs current in an (Al, Ga)As 
double-heterostructure laser taken with the apparatus shown in Fig. 1, except that 
the signal input to the phase detector was obtained from a silicon detector monitoring 
the light emission from the laser. 
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IV. SUMMARY AND CONCLUSION 


An experimental technique has been described which allows modu- 
lation of an (Al, Ga) As double-heterostructure laser at constant modu- 
lation index. Derivative techniques were then discussed which illus- 
trate the usefulness of this property in measuring the derivatives 
I(dV/dI) and I?(d?V/dI*) to extract the parameters describing a 
laser’s IJ-V relation and to investigate lasing-related voltage changes 
in this relation. Derivatives of light with respect to current were also 
obtained in the same apparatus. These results support the conclusion 
that derivative techniques will be very helpful in studying (Al, Ga) As 
lasers and similar devices. 
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APPENDIX 


The neglect of the —1 term is not thought to be significant. However, 
the assumptions of 8, R, and 7) independent of current may sometimes 
be unjustified. If 8 depends on current through the parameter n, the 
first-order correction to the derived value of 7 is 


an(t) = GV - IR,) <<. 


This correction can be large for quite reasonable J-V characteristics. 
Equivalently we find instead of eq. (2a) 


1S gt Va Gy tip. (1 - sat): (2b) 


dl B n dl 
Thus, both of the derived parameters 8 and R, are sensitive to the 
assumption of 8 independent of current. 
Analogous comments apply in the second derivative case where (8a) 
is replaced by 
PV 1 


Po = -3(0-255)+0- IRD — 


Expressions applicable when R = R(J) or Io = I(J) are easily derived. 


Pan 


nar =P) 
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On Kailath’s Innovations Conjecture Hold 


By V. E. BENES 
(Manuscript received October 30, 1975) 


With z, a signal process, w, a Brownian motion, and y, = fo! z.ds + w: 
a noisy observation, the innovations problem is to determine whether y. ts 
adapted to the innovations process v,, which ts also a Brownian mo- 
tion, and ts defined using the estimate 2, = E{z:\jy,, OSs St} by 
yi = So ésds + v1 The closely related c-algebras problem in stochastic 
DEs ts to determine, for a given causal drift a, when a solution 
of dé = a(t, &)dét + dw ts a causal functional of w,. Previous results on 
these problems are reviewed and extended. In particular, we broach and 
answer positively the physically important case of the innovations problem 
in which the signal satisfies a stochastic DE with drift depending in part 
on the noisy observations. This case is tmportant because it models a 
system observed through noise and controlled by feedback of these noisy 
observations. The last part of the paper shows that the innovations problem 
has a positive resolution tf and only tf on some probability space there is 
a Brownian motion W and a causal solution & of dé = a(t, &)dt + dW, 
where a expresses the estimator 2; that is, a ts a causal functional such 
that 4, = a(t, y). 


I. INTRODUCTION 


Estimation of signals from past observations of them corrupted by 
noise is a classical problem of filtering theory. The following is a 
standard mathematical idealization of this problem: The signal z, is a 
measurable stochastic process with E|z:| < ©, the noise w, is a 
Brownian motion, and the observations consist of the process 


t 
Ed I z.ds + Wi. (1) 


Define 2, = E{z,|y,,0 Ss S t}, the expected value of z given the 
past of the observations up to ¢. It can be shown! that if fo! z2ds < © 
a.s., then there is a measurable version of 2, with jo 2ds << as. 


The innovations process for this setup is defined to be 


t 
— i, Ge yan Ais 
0 
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and it is a basic result of Frost? and also of Kailath? that, under weak 
conditions, vy, is itself a Wiener process with respect to the observations. 
Thus, (1) is equivalent to the integral equation 


t 
Hes ie Sev (2) 


which reduces the general case (1) to that in which z, is adapted to y,, 
a special property useful in questions of absolute continuity in filtering 
and detection. 

Since 2, is of necessity adapted to y., Eq. (2) purports to define y, 
in terms of v,; the innovations problem, first posed by Frost,’ is precisely 
to determine whether it really does. Frost asked: Do the innovations 
contain all the information in the observations? [By (2) they do not 
contain more. ] In the language of probability this is to ask whether 
the c-algebras that the processes generate are the same up to null 
sets; i.e., is 


Vo Aclys, 8 St} = ofv., 8 S t} A NG (mod P)? 


Il. THE O-ALGEBRAS PROBLEM IN STOCHASTIC DEs 


The innovations problem is equivalent to an apparently more 
general problem from the theory of stochastic DEs, sometimes called 
the c-algebras problem: Given a causal drift a(s, x), possibly depending 
on the past of the function x, and a weak solution of the pr dz; 
= a(t, x)ds + dw, with w, a Brownian motion, and 2, possibly 
nonanticipating with respect to dw., to determine whether 


a{xts,s St} = o{w.,s S t} (mod P). 


Positive answers to both problems were widely conjectured. 

The innovations problem has been outstanding, in both senses of 
the word, since about 1968, and it has drawn the attention of communi- 
cations theorists and probabilists alike. The o-algebras problem has 
been current in the Soviet Union since the late 1950s; there it has been 
the object of great effort and a source of stimulus far in excess of its 
simple origins.* Accounts of the innovations problem and its theoretical 
background are in lecture notes by Meyer! and in a paper by Orey.® 

It is now known that the answer to the general problem is in the 
negative. B. Cirel’son has given a counterexample*:!” for the following 
special case (this case shows, incidentally, that the innovations and 
o-algebras problems are in fact the same): Suppose that the signal z, 
is a causal functional a(é, y) of the observations; i.e., the signal is 
entirely determined by feedback from the observations. Then z = 2, 
w = v, and the problem reduces to asking whether the observations 
are ‘‘well-defined”’ in the strong sense of being adapted to the noise; 
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for in this vestigial or degenerate case, the noise is the only process 
left. Cirel’son’s disturbing example consists of a choice of a(-, :) for 
which there is just one weak solution y,, which is nonanticipative in 
that the future increments of w. are independent of the past of y., 
but which cannot be expressed as a functional of w., causal or not, 
over any interval. 

Prior to this counterexample, several cases of the problems had 
been settled in the affirmative. J. M. C. Clark’ proved that if noise 
and signal are independent and the signal is bounded (uniformly in t 
and w), then observations are adapted to innovations. The author? 
extended Clark’s method and result to the case where signal and noise 
are independent and the signal is almost surely (a.s.) square-integrable. 
The case of gaussian observations turns out affirmatively: here results 
of Hitsuda® imply that 2, is a linear functional of the past of y, and 
eq. (2) is solvable by a Neumann series. Zvonkin” has given an affirma- 
tive answer to the o-algebras problem for the Markov case a(s, y) 
= a(ys) bounded and homogeneous in time, using the associated scale 
function to transform the state space; this result extends to time- 
dependent bounded a(s, y;) satisfying Dini’s condition. 

It should be remarked that although the innovations and o-algebras 
problems are mathematically equivalent, they arise in different 
contexts, involve different emphases, and can be usefully contrasted, 
as discussed below. 

The innovations problem arose in filtering theory, and it focuses 
especially on the nature of the filter or operator that gives 2; as a 
causal functional a(é, y) on the past of y,; from this point of view, 
the example of Cirel’son, in which there is no real filtering going on, 
is a bit wide of the mark; the real problem is to find out enough about 
the filter to be able to settle whether o{y} = o{v} in cases where 
there is a real signal (determined in part by sources other than the 
noise, and in part possibly by control or feedback based on the obser- 
vations), which it is desired to control, transmit, filter, or detect. 

The o-algebras problem arises in stochastic functional pEs, and is 
therefore more general in scope, since the drift functionals considered 
need no longer be filters or conditional expectations like 2;; the em- 
phasis is on dynamics, causality, and nonanticipation, with no ad- 
mixture of estimation. If the drift functional to be considered is a 
filter, it may have special properties that are useful in the investi- 
gation. (See the method of Clark.’) 


ill. SUMMARY 


Relevant notions from stochastic DEs are defined in Section IV: 
causal functionals, weak solutions, causal solutions, and nonanticipa- 
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tive solutions. A n.a.s. martingale-type condition for validity of the 
innovations conjecture is given in Section V; as an application, this 
condition yields (Section VI) the (known) conjecture for the case of 
gaussian observations. Section VIII describes some of Zvonkin’s posi- 
tive results for the Markov case and an extension. In Section VIII, we 
investigate the problem of calculating the estimate 2, and give various 
relationships based on absolute continuity of measures. Section IX 
is devoted to the physically important case of signals z, that solve 
stochastic DEs with drift based on feedback of observations; we show 
that if this drift is Lip in the feedback of linear growth in the signal 
uniformly in the observations, then observations are adapted to innova- 
tions. In Sections X and XI, finally, we show that validity of the in- 
novations conjecture is equivalent to the causal solvability, on some 
probability space, of the equation di = a(t, §dt+dW, with W 
Brownian and a the (a?) functional, such that 4: = a(t, y). 


IV. CAUSAL SOLUTIONS OF STOCHASTIC DEs 


A measurable functional y: [0, ©) X C[0, ~) — # is called causal 
if for each t € [0, ©), x, = y, for s S t implies 


v(s,t) = y(s,y) 2,y ECO, ~). 


The idea expressed by this definition is the physical one that y(t, -) 
cannot depend functionally on any more than the past of its argument 
up to ¢; thus, it has the same value at ¢ for two functions that agree 
for s S t. In spite of the presence of the word ‘‘depend’’ in the previous 
sentence, causality of a functional is expressible as a measurability 
property, and has no immediate relation to any probability measure. 

Let a be a causal functional. A weak solution of the stochastic DE, 


dx = a(t, x)di + dW, W. Brownian, (3) 


is a process £, such that 
t 
(TH. = & — f a dea 


is a Brownian motion on its own past. If £, is adapted to v.; 1.e., if for 
each ¢, & is measurable with respect to o{v,,s S t}, then &, is called 
a causal solution, and there is a causal functional ¢ such that &,; 
= g(t, v) at each ¢ with probability one. A solution &, is called non- 
anticipative if (roughly) the future increments of v, are independent of 
the past of &.; i.e., for each ¢ 

o{ My —MmyUe t} | o{&., ss t}. 


This is a probabilistic property, and it is equivalent to v’s being a 
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martingale on the larger algebras o{£,, s S ¢}. It can be seen that for 
(3), causal > nonanticipative, but the converse is false: it is known 
that there are drift functionals a for which there exist nonanticipative 


weak solutions, but no causal solutions.!” 


V. A SEMI-MARTINGALE CONDITION 
Theorem 1: o{vs,8 S t} = ofys,s S t} (mod P) for each t = 0 iff there 


ts a Yoj-adapted martingale x, and a causal functional y: [0, @) 
xX CLO, ©) +R such that the observations are representable (as a semi- 


martingale on their own past) by 
t 
Ys = t+ f (s, x)ds. 


Proof: The hypothesis on y, and the equation dy = édt + dv imply 
that 


eae i Serr are (4) 


The innovations process is a Wiener process with respect to the 
observations; thus, the left side of (4) is a continuous Yp-martingale. 
Since the right side is absolutely continuous in ¢, it follows that both 


sides vanish identically, so that x, and vy, are indistinguishable pro- 
cesses, and 


But the right-hand side is v-adapted because y is causal. The theorem 
follows from dy = édt + dv. For the converse, we argue thus: if the 
a-algebras coincide, there is a causal functional ¢ such that y; = ¢(t, v). 
The innovations theorem makes v, a Y-martingale with 


t 
Ye = vet I 2 ogds; 
then, take x = v andy = Zog. 


Vil. APPLICATION TO GAUSSIAN OBSERVATIONS 


The theorem just proved affords us a simple demonstration of the 
validity of the innovations conjecture for gaussian observations. 
Suppose that the signal z, is square-integrable almost surely. Then, a 
theorem of Kailath and Zakai" implies that the measure induced by y. 
is absolutely continuous with respect to Wiener measure. The class of 
gaussian processes absolutely continuous with respect to Wiener 
measure has been characterized in a causal way by Hitsuda’: a process 
y. belongs to this class iff there is a Wiener process W. adapted to y, 
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and a Volterra kernel m(-, -) € LZ[0, 1]? such that 


t 8 
Ye = Wet / i m(s, u)dW.ds. 
0 0 
Thus, W. is a martingale on the past of y, and in Theorem 1 we can set 


x= W,v(s, W) = fo m(s, u)dW.z, to conclude that v is W and 2 is y. 
Iteration of the relation between y and W gives 


t s 8 
w= Yt- i (f m(s, u)dyu — I m(s, u) 
0 0 0 
/ m(u, v)dudyy» + -- as, 
0 
so that letting 
t 
L(t, s) = m(t, s) — | m(t, u)m(u, s)du + ++ 
0 


be the Neumann series or the resolvent of m(-, -), we see that 


é 8 
Ye = vet I I L(s, u)dyuds 
o Jo 


t 
2, i] L(t, u)dyu. 
0 
Thus, the map @ is linear in y, when y, is gaussian, as was expected. 


Vil. RESULTS OF ZVONKIN FOR THE MARKOV CASE 


For a stochastic DE of the form dy = a(y,)dt + dw, Zvonkin"” has 
shown that if a(-) is bounded, then there is a causal solution y,. His 
procedure” is to look at the scale function 


u(y) = I exp — af" a(s)dsdz = |, e@ae (5) 


and to note that c{y,,s S t} = of{u(ys), s S t} because u(-) is mono- 


tone. Then to show that u(y;) can be got causally from w., he uses 
Ito’s rule on 2; = u(y) to get 


dz, = B(y:)dy: — B(y:a(y:)du: 
=> Blum (z.) Jdw:. 


By calculus he finds a(-) bounded => 8(u-) € Lip; hence, z, 1s a 
causal functional of w. and so is y.. 

Now this argument depends in part on the fact that w satisfies 
4u"’ + u’a = 0, and at once suggests extensions to the inhomogeneous 
case a(t, y) = a(t, yz). We give an example based on Zvonkin’s paper :” 
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Theorem 2: Suppose that for some strictly increasing function k(-) there 
exists a solution u(t, y) of the Cauchy problem 


u(O, y) = ky) 
Ur + 422 + alt, y)ue = Lu = Ot 
such that (log uz). ts bounded. Then, the stochastic DE ’ 
dy: = a(t, y:)dt + du; (6) 
has a causal solution. 
Proof: Since kT, it is clear that u, >0O and that the transformation 


z: = u(t, yz) is bijective. Using Ito’s rule, we calculate the stochastic 
differential of z, as 


dz, = (Lu) (t, y:)dt + uc(t, y:)dw: 


uolt, u*(t, 24) |dwe. (7) 


Now 


0 =| = Usalt, ut (, z) J 
re uelt, u(t, z)] = Pn TSO ul 


(log tis) 9]|you=*(e2) bounded. 


Hence, by the usual Ito’s theory of stochastic pEs, the martingale 
eq. (7) has a unique causal solution z, which is a bijection of y. point- 
wise in time. Hence, y, is a causal functional of w, too. The homogeneous 
case follows if we take u(t, y) = the scale function (5). 

In a similar vein we can show this result : 


Theorem 8: If a(-, -) ts bounded and such that 


ary 
=f, a(t, z)dz 


exists and ts bounded, then (6) has a causal solution. 
Proof: Let 


u(t, y) = i exp — 2 in a(t, x)dxdz 
0 0 
so that ue > 0, and z: = u(t, yz) is bijective. We have 
dz, = ust, u(t, 22) Jdt + welt, u(t, 24) Jdwr. (8) 
By calculus obtain formally 
= ult, w*(6, 2)] = —2alt, 2) 


0 0 z 
e- =i = aoe: 
ep ueLt, u(t, z) ] 2 at I a(t, x)dx, 


t Numerical indexes show which variable is differentiated and how many times. 
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both bounded by hypothesis. Hence, (8), and so (6), has a causal 
solution, by Ito’s theory. 

Note that Zvonkin’s Markov case is not relevant to Kailath’s 
innovations problem, because the filter giving 2; will almost invariably 
depend with advantage on the whole past of the observation process. 


VIII. CALCULATION OF THE ESTIMATOR 2 


Let (Q, B, P) be a probability space on which are defined the signal 
process z,, the noise w,, the estimate 2,, and the innovations process »., 
related by 


t t 
yi = i pide qos I be Ae oe: 
0 0 


Under some mild technical assumptions, this setup has implicit in it a 
rich structure that allows us to give a ‘‘formula”’ for 2, znter alia. To 
penetrate deeper into the situation, it is convenient to introduce an 
absolutely continuous change of measure which makes the observation 
process Brownian. We shall restrict attention to the interval0 S$ ¢ < 1, 
and assume that 


1 
(i) i Zds< © as. 
0 
(ii) There is a system of increasing o-algebras 5, 0 St <1, to 
r 


which z, and w, are adapted, and w, is a Wiener process with respect 
to (P, §.). 


“3 I ae 
(ii) E exp| — / z:,dws — 5 / sds| = 1. 
0 0 


Then by Girsanov’s theorem," the observation process y, is a 
Wiener process on §, (and thus on Y = o{ys,s S -}) under the 
transformed measure Po defined on %1 by 


dPo _ ae eae ee 
WP exp| i 2,dWs oe cts| 
It is convenient to use the functional notation® 
t 1 t , 
af, g)t = exp I Fsdgs = 2 I fas} x 
0 0 
Then (dP./dP)* = q(z, y)1 > 0 as. and for A € 51 


dP dP _ 
I qtz, Y). ap dP, ape? q(z, y)1 


P(A) 


so that P< Py and so P~ Pp». The following formula for 2 is then 
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readily justified: with Y¥§ = o{y,,0 Ss S 2}, 


g, = Hole y)alYoh 
Eotatz, y)1|Yo} 
For let A € Y, so that 


Vig 
[ zap = [ agar = | Bolzae, y)1|Yb}dPo 
Enlzq, y1lYo} ; 
i Ev{a(z, iT YE} Eo{atz, Y)1|Yo}adPo 


_ ff Eotzale, ysl Ys) 
Ja Bola, waluey (9) 


since for A € Yo and a Yj-measurable function 2 integrable 


OP ie a ae 
I Wp APs = i a sF: ry dP o. 
Thus, the ratio (integrand) in eq. (9) is a version of 2,. The process 
GP xt 
Ey | ap, seh 
is a positive martingale on the past of the Brownian (under Po) motion 


y, and can therefore be expected to have a special form. It can in fact 
be shown by arguments of Shiryaev and Liptser that 


dP ‘4 
z| 7P so} = gq, Y) t. 
From this it follows that 
dP ie 
Eo | Sr. v5] = q(8, y)t. 


It is obvious intuitively that q(z, y)1 in eq. (9) can be changed to 
q(z, ye: since q(z, y). is an §,-martingale, we have a.s. 


dP 
Eo | aD: q (2, lv = 1 
dP 
Ly | Sp. | = q(z, y)t 


dP 
Ey |=: Sp. | = Eo{zq(z, y)t| Fe}. 


Hence, Yo € SF; gives 


dP 
Eo \21 a |VYor = Holzqtz, y):|Yo}, as. 
dP 5 
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Thus, the filtering 2, can be represented as 


_ Bote, y)e|Yo} 
q2, y)e 

_ Yatty, a, wr 
g, y)e 


Re 


The last equation, it can be verified, is an identity valid for any Y- 
adapted, a.s. square-integrable functional, not just . Thus, the meat 
of the formula is the numerator, as could be expected intuitively since 
the denominator is basically a normalizer. 


IX. SIGNALS SOLVING ITO DEs WITH DRIFT BASED ON FEEDBACK 
OF OBSERVATIONS 


The positive results of Clark,’ based on the assumption of indepen- 
dence between signal and noise, seem adequate for many practical 
purposes of one-way communication or detection. For the more 
general physical applications to estimation and control, involving 
feedback of observations to control the signal, it would be pleasant to 
be able to weaken this assumption and allow some physically reason- 
able dependence between signal and noise. A natural setup to investi- 
gate is a generalization of the usual Kalman filter situation, in which 
the signal and the observation each solves a stochastic DE, with 
independent driving white noises, and with the drift for the signal 
equation depending on the observations. Thus, we let the signal z, 
and observation y, respectively solve 


da, = b(t, 2, y)dt + aW, (10) 
dy: = Z,dat + dw. (11) 


The second equation is, of course, eq. (1) differentiated ; the functional 
b, causal in both z and y simultaneously, represents the deterministic 
dynamics of a system described by z, and depending on the past of 
both signal and observation. 

As noted at the end of Section I, the only way to make headway is to 
find out something about the form of the filter that gives 2,; we shall 
show that in the case of eqs. (10) and (11) an analog of the Kallianpur- 
Striebel'® formula for 2 provides enough structure on which to hang a 
proof similar to Clark’s.’ 

Let us assume, as is physically reasonable, that b(t, z, y) grows at 
most linearly with supp<;<;,|Zs|, uniformly in y. Then 


Eqlo(W, w), WiaW, w)t = 1, 
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and we can “solve” (10) and (11) by Girsanov’s theorem in such a way 
that the joint solution process (2:, yz) is absolutely continuous with 
respect to the two-dimensional Brownian motion, here (W;, w:), with 
derivative qlb(W, w), W]iq(W, w)1. It is then easily seen that 2, 
should have the form 


2, — LMAW)a Wy), WW Ws vey aa) 

SM (dW)qlb(W, y); Wa, y)t 
where M is the Wiener measure for W alone. Verification is left to the 
reader; either of two more or less equivalent methods will do: direct 
integration over Yj sets using the absolute continuity or introduction 
of P; by dP; = q(—2, w)ql—b(z, y), W]dP, and a use of it similar 
to that of Po in Section VIII. Note that P; makes (z, y) a 2-dimensional 
Brownian motion. 


Theorem 4: If z,and y, are nonanticipating solutions of eqs. (10) and (11), 
and there exists a constant K such that for x, &, and» € C[0, ~) 


[b(t, 21, »)| S K[1 + sup |2i(s)|J, (13) 
OSsSt 
| b(t, x, n) ae b(t, v, é) | = HS up ly a é| 
Sst 


then o{ys, sSt} = of{v,, s S t} (mod P). 


Remark: Eq. (13) 4 (10) and (11) have a unique causal solution. 
In fact, our argument will not devolve on whether eqs. (10) and 
(11) have a strong solution at all; the unique (in law) non- 
anticipative solution is the Girsanov solution, with derivative 
qLb(W, w), Wla(W, w), which determines 2 via eq. (12). 


Proof: With the explicit form eq. (12) for 2 available, a form of argu- 
ment previously used by the author® (and generalized from that of 
Clark’) can be used: we exhibit a sequence of v.-adapted processes 
converging to 2,; the result then follows from eq. (2): Let 


S(m, t) = { sup |W.| Sm}, m=1,2,---. 


OSsSt 
It can be seen that the approximations 
Joo, MEM aL, vy), WIA, we 


an (t) = 
i= t) M(dW)qlo(W, y), Wha (W, y)s 


= ém(y)e (14) 


approach 2; as m —, and that each one is adapted to y,; therefore, it 
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is enough to prove that each one is adapted to », itself. Now set 


A 


0 = 0, m=1,2,-:- 


t 
ye = I "ds ++, mAn 
0 


IV 
_— 


ee x bm (y™") ty 
¢ A 
yr = /, 2m(s)ds + Vey 


and note that d(y™" ~— y™) = (é™" — @,,)dt and 


t 
afi ye = af, y™) exp A fs(2™™ — 2m) sds. 
With 2". — 2, = yw” for short, we find from eq. (14) that 


mnti qlb(W, gee”); Wi.aq(W, are), 
verti= fila, 


a qLb(W, y™); Wi.a(W, y™)t 
Ar (y™) ‘ 
where 


Arf) = fi M(dw)aLb(w, A), what, Ne 


Subtracting the two fractions on the right of y7"*! above, and using 
the further abbreviation p(f, g): for gLb(f, 9), f]t; we find 


ype f, M(dW)M (dw)w, 
(m,t)?* 


p(w, y™).q(w, y™) DW, y™").q(W, y™") 


— plw, y™")a(w, y™™) pW, y™) QW, y™) 
At (y™™) Ar (y™) 


The numerator in the integrand is just 
p(w, y”).q(w, y™) p(W, y™)g(W, y™) 
[esp | f° Ler, ye) — 07, ym) Jew 
— 5 [Or ym) — BO, eyes +f Waders} 
— exp | f° Co(an ym) ~ 000, vr)aw — Ff Coe, 9 
— b(w, y”) Jds + iP wavtras| ‘ 
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so we can use the inequality |e4 — e?| S 4(e4-+ e?)|A — B| to find 
that . 
M 
wel ssf M@w)M (du) 
2 S(m,t)? 
p(w, y™)ig(w, y™) pW, yg (W, y™”) 
+ p(w, y™™)ia(w, y™™) Dp (W, y™) QW, y™) 
AM (y™") A? (y™) 


| J W07, ym) — bOW, ya 
—5 [ er, ym) — BOY, yas +f" Waperas 
— CoC, ym) = bw, y) Taw 


1 t es eee ne _ ' m,n 
+5 [ [o*(w, ym) — (ew, ym) lds — f° wap as 





The Lipschitz and growth conditions on b imply that on the range of 
integration, with z = W or w 
[i vrrar 
0 


2Ke(1 +m) [ [ye"ldu, 


|b2(x2, ym") — B(x, y)| S 2K2(1 + m) sup 
Osuss 








IIA 


where we have used d(y™" — y”) = y™ dt. Hence, with 


aan (W) = f° o(W, ym") — BOW, yw) aM, 
we find 


ivent| sme f |ype|ds + 2x20 +m) f° f” [vel duds 


DW, ym). W, y™") 
$2 fo Man) | 


p(W, y™)q(W, y™)e |, 
* Af (y™) 


Since @nn is a stochastic integral, Schwarz’s inequality implies that 


Foo, MGM na DW, vm") Ws ve 


Ze[ (wer) 


A os M (dW) p?(W, y™").e(W, ye) (15) 
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To bound the second factor on the right, we use 


t 
P(f, 9+ = (2F, 9): exp ie Ifl2ds 
and the relations 


t 
Yi = if 2™ds “+ Ve 
0 
t t 
/ W,dv = Wy: — i: vy,dWw 
0 0 


to find that on S(m, t) 
pW, go) ig’ (W, y™)e 
t t 
= g[2b(W, y™), W].q(2W, y™): exp { b(W, y™)ds + I Wias| 
0 0 


= of 20(, ys) — 2, Weexp | f° (OCW, 9"). — 2>-Fas 
0 
+ 2y,W, + 2 f W,attds + [ wi} 
S gL2b0(W, y”) — 2v, WJ: exp {3m + 2m] ».| 
t 
+ f [KG +m) + [>| Pas} 


Since the q factor on the right integrates to 1 with respect to M(dW), 
it can be seen that the square root of 


Joc, MEM PW, uP, 
is bounded by a ¢-integrable function depending on m and ». Since 
[é~"| < m, the same result holds with y”" for y™ in p and q. Also, by 
Jensen’s inequality, with S = S(m, t) 
Ary") = f M(aW)qLb(W, y™), Wa, y™). 
t t 
= M{S} exp {arts} f | for, yaw — 5 fear, was 
sLJo 0 


t 
if W.atds + ah Wedge a Wias | 
0 0 2 0 


M{S} exp {2K + m)*t—3mt — m|y;| 


+ 


IV 


t 
f M(aw) [ [b(W, y”) s = »aW,| : 
s 0 
If T = inf s: |W,| = m, the integral in the exponent is bounded by 
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the square root of 


i M(aW) if ' XpysCb(W, y") — vPds < i “TRG omy Aled Pas. 


The same argument applies to A?”(y™”), since |2%"| S< m. 
It follows that 


t t 8 
re] sme f lyeelds + 2K +m) ff lyee|duds 


+ F(t, m, nlf (f, lve"|du) ds] 


where F is a ¢t-integrable function depending only on m and »v. Thus, 
by arguments similar to those for Gronwall’s inequality, it is seen that 

” converge to zero. It follows that 2. are v.-adapted, and so is y, 
by eq. (2). 


Remark: The reader is invited to speculate on how the above proof 
would be carried out if it were postulated that the dependence of 
b(é, z, y) on y in eq. (10) came only through the estimate 2, as, for 
example, b(t, z, y) = G(t, z, 2). In this case, there is no longer a formula 
for 2, but only a functional equation. 


X. DISCUSSION OF THE GENERAL PROBLEM 


There is a general result of measure theory to the effect that a 
function x is measurable on the o-algebra induced by another function 
y, iff it is representable by an explicit composition with y, that is, as a 
function of y: x = goy. This might be called the “explicit”? function 
theorem, as opposed to the “implicit” function theorems, like Filippov’s 
lemma. For here x and y are given and ¢ is to be found, while in 
Filippov’s lemma z and ¢ are given and y Is to be found. We suggest 
that the c-algebras and innovations problems are very close in spirit 
to the ideas around the explicit function theorem. This suggestion only 
provides what we think is a hilfsaussichtspunkt; without more informa- 
tion (about 2 or a(-, -), and more insight and work, it does not help 
settle any particular case. What it helps do, though, is place the problems 
and concepts into the general framework of stochastic equations, 
especially into the circle of ideas developed by M. P. Yershov.'® See 
also Ref. 18. 

Our final results on the innovations problem will clarify the role of 
the integral equation relating y, and v,. Thus, for each t, 2, is measurable 
with respect to o{y., s S t}; hence, there is a causal functional a such 
that 2; = a(t, y), or more precisely, such that some version of 2; 
is indistinguishable from a(t, y); then the relation between y, and y, is 
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essentially 


ue [ “Gas eae (16) 


Thus, it is apparent intuitively, and can be proved, that if y, is adapted 
to v., then there is a causal solution to (4), namely y, itself, expressible 
as ¢(-,v) with ¢ causal. What we shall show is the converse, that 
causal solvability somewhere of the stochastic pe (16) implies a 
positive answer to the Frost-Kailath conjecture. In particular, we shall 
prove that of{y.,s S t} = o{vs,s < t} (mod P) for each ¢ iff on some 
probability space there is a Brownian motion W, and a causal solution 
é of 


b= ie a(s, tds + Wr, (17) 


which induces the same measure as y, does. 

This result gives a necessary and sufficient condition for Frost and 
Kailath’s innovations conjecture to hold, and it embodies the sense 
in which the innovations problem resembles the explicit function 
theorem. The direct part or necessity is obvious. For the sufficiency, 
we argue that if eq. (17) has a causal solution on some probability 
space, then it is expressible as a causal functional ¢ of a Brownian 
motion defined there. This functional can be ‘‘exported’’; i.e., it can 
be applied to any other Brownian motion on any other space to give 
a causal solution. In particular, applying it to the innovations process 
v, gives a causal solution (gv); = y(t, v), which under weak conditions 
induces the same measure as y, does. This, along with the properties 


(gv): — i a(s, gv)ds A (Ter): = vy as. (18) 


o{v,,s St} Coly,s S t}, (19) 
allows us to prove the basic property that for any integrable causal 
functional @(t, y) 

E{B(t, y) | Ys, ss t} => B(t, pv) aS. 


This result can be applied in several ways to give the desired final 
result that y, and gv. are modifications of each other. We describe 
two—one a digressive application of martingales and the other short 
and direct. 


XI. MARTINGALE ARGUMENTS USING P, 


In this section, we use some of the properties of the measure Po 
defined in Section VIII. We assume that a@ is a causal functional such 
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that 2; = a(t, y); 1.e., really such that 2, and a(é, y) are indistinguish- 
able processes. We let ©; be the Borel c-subalgebra of C[0, ©) gener- 
ated by sets of the form {x:2,C B}, s St, B Borel, and MM is 
o{v,,0 Ss S t}, the c-algebra generated by the innovations. 


Theorem 5: If there is, on some probability space, a Brownian motion W 
and a causal solution & of 


dé = a(t, &)ds + W, 


given by a causal functional ¢ as & = ¢(t, W), and if gv and y are 
identical in law, then they are modifications of each other, and 34 = Yh 
(mod P). 


The proof is the sequence of lemmas which follow. 
Lemma 1: E{ (dPo/dP)| 6} = q(aer, ov). 


Proof: Let A € 9, so that by the integral equation, A differs from 
a set of the form {Ty € B}, B € @:, by at most a null set. Then, 


[ Ps f B{ TI vl fies / (6, y)dP 
A dP yET B 
= q(agr, yr), dP = I q(agy, gv)dP. 
gvET1B vEB 


Thus, q7(a¢gv, gv): is measurable on 2 and has the same integrals 
over 9 sets as dPo/dP; thus, it is a version of E{(dPo/dP) |}. 


Lemma 2: pv; ts a No-martingale under Po. 


Proof: y, is a (Brownian) martingale under Po; since 24 C Ys, then 
/ yidPo = i) ysdP 9 if ACM and s<t. 
A A 


For A € 9, eq. (2) implies that there exists BCC, with A 
= {Ty € B}. Since y ~ ¢» in law, there follows 


i y dP I yey) = f yi {q*(z, y)e| Yo}adP 
A TyEB TyEB 


I yg, yaP = f (ov) aay, ov) dP 
TySB TevEB 


dP 
/ (oni | Fr as} dP = i (ev) dPo. 
vEB A 


Similarly, for s < t and A € %j, 


J, yaPo = (gv) dP. 
A A 


Il 
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Hence, A € MN§ implies “4 Ler: — gv, ]dPo = 0, and so gy, is a 
(95, Po)-martingale, being adapted to Xj. 


Lemma 8: Eo{y:| 96} = eve a.s. 


Proof: This is shown in the same way as Lemmas 1 and 2, by inte- 
grating over A € 9, and using the adaptedness of » to y, and the 
property Tyv = vas. 


Lemma 4: gv, ts a Brownian motion under (Po, I). 


Proof: Let A, = 2, — a(s, gv) 
t 
/ A,ds 
0 


t 8 
(ye — gr)? = 2 /, A. f A,duds. 


Yie— OY 


However, since y, and gv, are Po-martingales on Yj and I, respec- 
tively, the change of variables formula applied to each separately 
gives 


t 


t 
gvi = 2 / gvdovs + (ov). (21) 
Also 
t 
Yous = Yi — Ye f, A.ds. 


The right-hand side is a product of semi-martingales on Yo, and 
change of variables gives 


t t t 
YUtPVe = 2 I ysdys + = 7 A.dys ss : ysAcds. 
0 0 0 


Using eqs. (20) and (21), we find that 


t 8 
= Be a 
(ye pv) (pv), — t+ 2 fi ‘A Aduds 


t 8 
— 2 f E — [ Adu — ov | dys 
0 0 
(ev): = t. 


Thus, yvv, is a continuous martingale with quadratic variation ¢, and 
so a Brownian motion, on (Po, I). 


Lemma 8: Eo(y: — gt)? = 0. 
Proof: The processes y, and gv, are Brownian under Pp with respect 


to Ys and M4, respectively, so Hoy? = Hogv? = t. Thus, by Lemma 3 
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Eo(y: — gv)? = 2(¢ — Evysgv:) 

2(¢ — Bool ys| No} er) 
2(t — EX ev):) 

= 0. 


The indicated expectation exists because both y, and gy, are Brownian 
under Po, on respective algebras Yj and 95. This lemma shows that 
y and yy are modifications of each other under Pp (and so under P), 
and completes the proof of Theorem 5. 


XII. DIRECT PROOF OF THEOREM 5 


It is possible to give a short proof of Theorem 5 not depending on 
the auxiliary measure Po or the representation for 2 given in Section 
VIII. This proof depends only on egs. (18) and (19), the causality of ¢, 
and the fact that y ~ yg» in law; otherwise, it is just an exercise in 
integration. 

By hypothesis there exists a causal functional ¢, such that Tex = x 
for almost all x with respect to Wiener measure. Thus, the process 
(vv): = y(t, v) (defined on the same probability space as y, and ».) 
is identical in law to y, such that 7’ yv = v with probability one. Let 8 
be a causal functional such that £|8(t, y)| < © for each ¢. The next 
step is to prove that 


E{B(t, Y) | Ys; OSss8 t} = Bil, gv), as. 


Let then A € o{»,,0 Ss S t}. A has the form {w: vy € B} with Ba 
Borel set of C[0,t], so by the integral equation it differs from 
{w:y © T-'B} by at most a null set. Then, since gv and y are identical 
in law, and ¢ C J-! on a set of Wiener measure one, we find 


/ B(t, y)aP = ‘) B(t, gv)dP 
{yE7TB} {grET-1B} 


I B(t, gv)dP = i B(t, ev)dP 
{TevEB} {vEB} 


[8G waP 


I, B(t, gv) dP. 


Thus, @(t, gv) has the same integrals as 6(t, y) over sets defined by », 
over [0, ¢], and (since ¢ is causal) is measurable on o{v,,0 Ss S t}. 
Hence, it is a version of H{A(t, y)|»;,0 < s S t}. To complete the 
proof, let B(¢, xz) = at(t, z)?, where at = max {0, a}, to find, since 
y ~ gv in law, that 
Elat(é, y)? — at(t, gv)i|? 

= 2Kat(t, y) — 2Hat(t, gv)?tH{atd, y)?|».,0 Ss St} = 0. 
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Similarly, E|a7(t, y)? — a-(t, gv)?|? = 0. Hence, 
a(-,y) = a(-, ev) as. XX P (A = Lebesgue measure), 
so that for each t, by the integral equations, 


yi — (or): = A [a(s, y) — a(s, gv) ]ds =0 as. 


Since ¢ is causal, it follows that y; is equal almost surely to a function 
measurable on o{»,, s S ¢}. Since this is true for each ¢, it follows that 
for each t the algebras o{»,., s S t} and o{ys, s S t} are equal (mod P). 


Remark 1: Since fo! 2ds < © a.s., then if also 


t 
‘i a(s, gv)*ds <0 as., 
0 


a theorem of Kailath and Zakai will imply that the respective measures 
induced by y and ¢» are each absolutely continuous with respect to 
Wiener measure with the same Radon—Nikodym derivative g[a(z), x]. 
Hence, y ~ gy in law, as desired for the hypothesis of Theorem 5. 
Thus, the condition that y and yvv induce the same measure Is easily 
met, in comparison with the difficulty of finding a causal solution. 


Remark 2: The condition in Theorem 5 that & be causal can be replaced, 
if we are content to work only over a finite interval [0, T'], by the 
conditions that be a strong solution over [0, 7] in the sense of Ref. 
16, and that it be nonanticipative. For it has been remarked by 
Yershov® that a strong (over [0, 7]) nonanticipative solution is 
necessarily causal. 
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Timing Recovery in Digital Synchronous Data Receivers. K. H. Mueller and M. 
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criterion. 


Wide-Band Partial Discharge Detector. E. A. Franke and E. Czekaj, IEEE Trans. 
Elec. Insul., EI-10 (December 1975), pp. 112-6. A wideband (80 MHz) peak- 
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for Multilayer Boards. A. Fox, J. Test. Eval., 4, No. 1 (January 1976), pp. 74-84. 
The mechanical properties of CuBath* electroplated copper, including creep, stress- 
relaxation, monotonic tensile properties, and cyclic strain ratcheting were studied as 
a function of temperature and plating thickness to provide design information as to 
the material response of pTHs in MLBs during differential thermal expansion 
and contraction. *Sel Rex Company, the Division of Oxy Metal Finishing 
Corporation. 


PHYSICS 


Combined Radiation and Developing Laminar Free Convection Between Vertical 
Flat Plates With Asymmetric Heating. J. R. Carpenter, D. G. Briggs, and V. Sernas, 
J. Heat Transf., 98 (February 1976), pp. 95-107. A numerical investigation of 
the interaction of radiation with developing laminar free convection in vertical 
parallel plate channels with asymmetric heating is presented. Radiation to the 
inlet-exit and the cooler opposing entrance wall alters the nonradiation results by 
reducing the wall temperature by as much as 50 percent. 


Continuous Operation of 1.0-um-Wavelength GaAsi_,Sb./Al,Ga;_,As:_,Sbz Double- 
Heterostructure Injection Lasers at Room Temperature. R. E. Nahory, M. A. 
Pollack, E. D. Beebe, J. C. DeWinter, and R. W. Dixon, Appl. Phys. Lett., 28 
(January 1, 1976), pp. 19-21. Double-heterostructure GaAsi_,Sbz/Al,Gai_y- 
As}-2Sbz injection lasers have been operated continuously at room temperature for 
the first time. Emission was near 1.0 ym. The lowest threshold current density 
observed was 2.1 kA cm™ de. 


Effects of Buffer Gases on Optically Pumped CH;F FIR Laser. T. Y. Chang and 
C. Lin, J. Opt. Soc. Amer., 66, No. 4 (April 1976), pp. 362-369. The effects of 
buffer gases on the performance of a 496-um, optically pumped CH;F laser are 
studied both experimentally and theoretically. A 55-percent increase in output power 
can be obtained by adding n-hexane vapor to an equal amount of CHF. 


4F Virtual Bound State Formation in CeAl; at Low Temperatures. K. Andres, J. E. 
Graebner, and H. R. Ott, Phys. Rev. Letters, 35 (December 29, 1975), pp. 1779-1782. 
Specific-heat and electrical resistivity measurements in CeAl; below 0.2 K reveal 
enormous magnitudes of the linear specific-heat term C = y7' (y = 1620 m/mole K?) 
and the T? term in p = AT? (A = 35 wQ cm/K?). We conclude that the 4f electrons 
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ee Fermi statistics at low temperature because of the formation of virtual bound 
4f states. 


Laser Action at 12.812 um in Optically Pumped NH;. T. Y. Chang and J. D. McGee, 
Appl. Phys. Letters, 28, No. 9 (May 1, 1976), pp. 498-501. Optical pumping of 
the aR(6,0) line of the vz fundamental vibrational band of NH, has led to laser 
action on the aP(8,0) line at 12.812 um of the same vibrational band. A 5-kW pulsed 
pulp has been obtained by using 1.5 MW of pump power at 9.294 um from a COz 
aser. 


Modulation Spectroscopy at Non-Normal Incidence With Emphasis on the Vacuum-uv 
Spectral Region. D. IE. Aspnes, C. G. Olson,* and D. W. Lynch,* J. Appl. Phys., 47 
(February 1976), pp. 602-607. Expressions are given to analyze modulation 
spectra taken at non-normal incidence. These expressions are used to determine the 
optimum angle of incidence to maximize the signal-to-noise ratio. Significant. im- 
provements are shown to be obtained in the vacuum-uv spectral region by making 
measurements at relatively large angles of incidence. * Towa State University. 


New Nanosecond Continuum for Excited-State Spectroscopy. C. Lin and R. H. 
Stolen, Appl. Phys. Lett., 28 (February 15, 1976), pp. 216-218. A new nano- 
second continuum is generated by nonlinear optical processes in fiber waveguides 
pumped with a 20-kW 10-ns dye-laser pulse of broad spectral width. The continuum 
bandwidth is several thousand cm™! in the visible with a total power ~1 kW. This 
continuum is useful for nanosecond time-resolved excited-state spectroscopy. 


Strength of 0.04-50-m Lengths of Coated Fused Silica Fibers. C. R. Kurkjian, R. V. 
Albarino, J. T. Krause, H. N. Vazirani, F. V. DiMarcello, S. Torza, and H. Schon- 
horn, Appl. Phys. Lett., 28, No. 10 (May 15, 1976), pp. 588-590. Tensile strength 
measurements have been made on coated fused silica fibers at gage lengths up to 
50 m. At gage lengths between 0.04 and 10 m, the average strength is essentially 
constant at about 5 GN/m? (700 ksi). At gage lengths greater than 20 m the length 
dependence becomes important and by 50 m the strength has decreased to about 
2 GN/m? (300 ksi). The minimum strength recorded in the testing of samples taken 
continuously from a length of 1031.6 m was 0.43 GN/m? (64 ksi). A ‘‘weakest link” 
model appears to be obeyed. 


Virtual Bound States of Pd in Cu, Ag, and Au, and of Pt in Ag. 8. Hiifner, G. K. 
Wertheim, and J. H. Wernick, Solid State Commun., 14 (December 15, 1975), pp. 
1585-1590. The positions and widths of the virtual bound states of Pd in Cu, 
Ag, and Au and of Pt in Ag have been measured by XPS. The spin-orbit splitting 
of the Pt state in Ag is comparable to the spectroscopic atomic value. The host 
lattice d-band structure is perturbed in the alloys. 


SYSTEMS ENGINEERING AND OPERATIONAL RESEARCH 


Combined Primal-Dual and Penalty Methods for Convex Programming. B. W. 
Kort and D. P. Bertsekas, Siam J. Contr. Optim., 74 (February 1976), pp. 268-294. 
We propose and analyze a class of combined primal-dual and penalty methods for 
constrained minimization which generalizes the method of multipliers. We provide 
convergence and rate-of-convergence analysis for these methods for the case of a 
convex programming problem. We prove global convergence in the presence of both 
exact and inexact unconstrained minimization, and we show that the rate of con- 
vergence may be linear or superlinear with arbitrary Q-order of convergence, depend- 
ing on the problem at hand and the form of the penalty function employed. 
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